UNITED  STATES  AIR  FORCE 


SUMMER  RESEARCH  PROGRAM  -  1997 
SUMMER  RESEARCH  EXTENSION  PROGRAM  FINAL  REPORTS 

VOLUME  4A 

WRIGHT  LABORATORY 


RESEARCH  &  DEVELOPMENT  LABORATORIES 
5800  Uplander  Way 
Culver  City,  CA  90230-6608 


Program  Director,  RDL  Program  Manager,  AFOSR 

Gary  Moore  Major  Linda  Steel-Goodwin 


Program  Manager,  RDL  Program  Administrator,  RDL 

Scott  Licoscos  Johnetta  Thompson 

Program  Administrator,  RDL 
Rebecca  Kelly-Clemmons 


Submitted  to: 


AIR  FORCE  OFFICE  OF  SCIENTIFIC  RESEARCH 


Bolling  Air  Force  Base 
Washington,  D.C. 
December  1 997 


20010319  009 


MO  \-  O  /I  ft? 


PREFACE 


This  volume  is  part  of  a  five-volume  set  that  summarizes  the  research  of  participants  in  the  1997  AFOSR 
Summer  Research  Extension  Program  (SREP.) 

Reports  presented  in  this  volume  are  arranged  alphabetically  by  author  and  are  numbered  consecutively  - 
e.g.,  1-1, 1-2, 1-3;  2-1, 2-2, 2-3.  Reports  in  the  five-volume  set  are  organized  as  follows: 
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Infrared  Image  Registration  And  High  Resolution  Reconstruction  Using  Rotationally 

Translated  Video  Sequences* 

Mohammad  S.  Alam 

Associate  Professor  of  Electrical  Engineering 
Purdue  University,  ET  327A,  Fort  Wayne,  IN  46805-1499 

Abstract 

Microscanning  is  an  effective  technique  for  reducing  aliasing  and  increasing  resolution  in  images 
produced  by  staring  infrared  imaging  systems,  which  involves  recording  a  sequence  of  frames 
through  subpixel  movements  of  the  field  of  view  on  the  detector  array  and  then  interlacing  them 
to  produce  a  high  resolution  image.  The  forward  looking  infrared  (FLIR)  detector  arrays  is 
usually  mounted  on  a  moving  platform,  such  as  an  aircraft,  and  the  normal  vibrations  associated 
with  the  moving  platform  can  be  used  to  generate  shifts  in  the  FLIR  recorded  sequence  of  frames. 
Among  the  various  image  registration  algorithms,  the  algorithm  proposed  by  Irani  and  Peleg1  has 
shown  the  potential  of  being  an  extremely  fast  and  efficient  method  of  registering  a  sequence  of 
image  frames.  This  algorithm  has  been  demonstrated  to  work  with  non  uniformity-corrected, 
aliased  imagery  collected  from  a  3  to  5  micrometer  thermal  imaging  system.  However,  since  the 
algorithm  is  gradient-based  and  depends  on  its  ability  to  accurately  calculate  spatial  changes 
between  images,  its  performance  over  all  imaging  conditions  is  unknown.  For  example,  it  is 
expected  that  fixed-pattern  spatial  noise  will  produce  errors  in  the  gradient  estimate.  Other 
factors  that  may  affect  the  accuracy  of  registration  include  the  severity  of  aliasing  in  the  imagery, 
image  contrast,  and  image  size.  This  research  project  was  directed  to  investigate  the  effect  of  the 
aforementioned  parameters  on  the  image  registration  algorithm. 


*  When  this  project  was  funded  in  March  1997,  the  Air  Force  Lab  Focal  Point  (Brian  Yasuda,  WL/AAJT)  informed 
the  PI  that  another  group  at  the  Wright  Laboratory  completed  the  investigation  of  high  resolution  image 
reconstruction  using  rotationally  translated  frames  and  they  are  more  interested  in  investigating  the  effects  of 
various  degrading  factor  on  the  registration  algorithm  proposed  by  Irani  and  Peleg1.  After  detailed  discussion  with 
the  focal  point  and  other  group  members,  it  was  decided  to  investigate  the  performance  of  the  registration 
algorithm  under  various  degrading  factors  such  as  noise,  frame  size,  aliasing  and  image  contrast. 
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Infrared  Image  Registration  And  High  Resolution  Reconstruction  Using  Rotationally 

Translated  Video  Sequences 

Mohammad  S.  Alam 

Associate  Professor  of  Electrical  Engineering 
Purdue  University,  ET  327A,  Fort  Wayne,  IN  46805-1499 


1  Introduction 

In  many  infrared  imaging  systems,  the  detector  spacing  in  the  focal  plane  array  is  not  sufficiently 
small  so  as  to  sample  a  band-limited  scene  at  the  Nyquist  rate,  resulting  in  a  degraded  image  due 
to  aliasing  artifacts.  Over  the  last  five  years,  the  Electro-Optics  Branch  of  Wright  Laboratory  has 
been  developing  a  microscan  imaging  technique  to  increase  the  spatial  sampling  rate  of  existing 
focal  plane  arrays  and  to  alleviate  the  aliasing  effects  in  infrared  imagery2,3.  The  microscanning 
process  uses  a  sequence  of  spatially  undersampled  time  frames  of  a  scene  to  generate  the  high 
resolution  image.  Each  frame  is  subpixel  shifted  relative  to  each  other  frame  onto  a  set  grid 
pattern.  The  sequence  of  frames  are  then  interlaced  to  yield  the  high  resolution  image  which 
represents  the  input  scene  effectively  sampled  at  a  higher  spatial  frequency.  The  aforementioned 
process  is  called  controlled  microscanning  because  the  subpixel  shifts  between  the  temporal 
image  frames  are  controlled  and  therefore  known  a  priori.  Uncontrolled  microscanning  is  the 
process  where  the  shifts  for  each  frame  are  unknown  and  must  be  estimated  before  registering 
onto  the  high  resolution  grid  pattern.  The  shifts  for  each  frame  are  unknown  because  they  are 
generated  by  the  random  motion  and/or  vibration  of  the  imager  platform  such  as  an  aircraft  and 
not  by  a  microscan  mirror. 

Among  the  various  image  registration  algorithms  proposed  in  the  literature,  the  image 
registration  algorithm  reported  in  Reference  1  has  been  found  to  be  particularly  attractive.  This 
registration  algorithm  has  shown  the  potential  of  being  an  extremely  fast  and  efficient  method  of 
registering  a  sequence  of  image  frames.  The  algorithm  has  been  demonstrated  to  work  with  non 
uniformity-corrected,  aliased  imagery  collected  from  a  laboratory  3-  to  5-p.m  thermal  imaging 
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system.  However,  since  the  algorithm  is  gradient-based  and  depends  on  its  ability  to  accurately 
calculate  spatial  changes  between  images,  its  performance  over  all  imaging  conditions  is 
unknown.  For  example,  it  is  expected  that  fixed-pattern  spatial  noise  will  produce  errors  in  the 
gradient  estimate.  Other  factors  that  may  affect  the  accuracy  of  registration  include  the  severity  of 
aliasing  in  the  imagery,  image  contrast,  and  image  size.  Therefore,  an  evaluation  of  the 
performance  of  the  registration  algorithm  proposed  in  Reference  1  may  involve  determining  a 
method  to  characterize  the  algorithm’s  performance  under  various  levels  of  degradation.  The 
main  degrading  factors  to  be  examined  include  fixed-pattern  spatial  noise,  temporal  noise, 
aliasing,  contrast,  and  image  size.  A  measure  of  the  severity  of  the  degradation  under 
consideration  must  be  determined.  Plots  of  registration  error  versus  degradation  measure  may  be 
a  useful  indicator  of  general  trends.  It  may  also  be  informative  to  combine  the  degrading  factors 
in  some  manner  to  determine  the  total  effects  on  registration  error. 

To  investigate  the  effect  of  various  degrading  factors  on  above  mentioned  registration  algorithm1, 
a  450x450  pixel  infrared  image  data  collected  on  a  real  time  infrared  imaging  system  was  used  as 
the  high  resolution  image.  Since  all  parameters  of  the  imager  optics  and  detector  array  are 
known,  it  is  possible  to  accurately  model  the  optical  transfer  function  and  the  detector  transfer 
function  which  are  then  used  to  obtain  the  system’s  modulation  transfer  function  (MTF).  The 
high  resolution  image  was  blurred  by  the  system’s  MTF  and  a  sequence  of  nine  low  resolution 
frames  were  generated  from  this  blurred  image. 

Various  amounts  of  aliasing  and  noise  was  added  to  the  aforementioned  image  data  which  are 
then  used  as  input  to  the  registration  algorithm.  In  addition,  a  number  of  various  contrast  imagery 
has  been  produced  from  simulations  of  the  identical  image  to  correspond  to  different  times  of  day 
and  year.  The  contrast  of  the  imagery  was  adjusted  to  represent  the  contrast  in  typical  infrared 
imagery  as  closely  as  possible.  From  the  simulation  results,  it  is  evident  that  except  for  the 
spatio-temporal  noise,  the  registration  algorithm  is  susceptible  to  various  levels  of  registration 
error  with  respect  to  the  other  degrading  parameters  such  as  the  fixed  pattern  noise,  aliasing, 
image  size  and  image  contrast. 
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2  The  Sampling  Process 


2.1  Sampling  in  a  FLIR  Array 

In  a  FLIR  array,  sampling  is  performed  by  a  finite  sized  array  of  detectors  and  three  main  factors 
must  be  taken  into  consideration:  the  optical  point  spread  function,  the  detector  charge 
integration,  and  the  detector  array  geometry.  The  block  diagram  in  Fig.  1  illustrates  the  sampling 
process  in  a  FLIR  array.  The  object  scene,  denoted  o(x,y),  is  convolved  with  the  point  spread 
function  of  the  optics,  psf(x,y),  and  the  aperture  function  for  the  square  detector3,  d(x,y). 


Function  Integration  r(x,y)  samp(x,y) 


Finite  Detector  Sampling 
Array  Function 

Figure  1.  Block  diagram  of  the  imaging  system. 

For  our  imaging  system,  we  assume  the  detector  has  a  flat  response  across  its  active  region  so  the 
detector  function  can  be  expressed  as: 

*<■*■*> (1) 

where  a  and  b  are  the  dimensions  of  the  individual  detectors.  The  result  is  multiplied  by  a 
function  representing  the  limited  extent  of  the  detector  array,  r(x, y),  expressed  as 


r(x,  y)  =  rect 


(2) 
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where  X  and  Y  are  the  dimensions  of  the  array.  To  apply  this  integration  to  all  of  the  detectors,  it 
is  multiplied  by  the  sampling  lattice,  combAXiiy(x,y), 

10000/  \  00  00 

combXx  Av (x, y)  =~j£^  E  E  =  E  E  <?  (*  “ mAx> y ~ ”A>’)  (3) 

where  Ax  and  Ay  are  the  center-to-center  detector  spacing.  The  resulting  expression  for  the 
staring  image  is 


An  illustration  of  a  uniform  detector  array  showing  critical  dimensions  is  shown  in  Fig.  2. 


Y 


Figure  2.  The  detector  array. 

2.2  Controlled  Microscanning 

In  controlled  microscanning,  the  subpixel  shifts  between  temporal  image  frames  are  controlled 
and  therefore  known  in  advance.  A  level  L  microscan  is  defined  to  be  the  case  where  L2  staring 
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frames  are  acquired  for  each  high  resolution  frame  and  each  staring  frame  has  its  own  unique 
controlled  subpixel  shift,  each  shift  being  part  of  a  uniform  grid.  Thus,  each  staring  frame  has  a 
shift  that  is  integer  multiple  of  1/L  times  the  detector  width.  The  staring  frames  are  interlaced  in 
an  LxL  pattern  to  produce  a  high  resolution  frame  of  size  NLxNL  where  N  is  the  size  of  the 
square  detector  array.  For  a  level  2  or  2x2  microscanned  image,  the  original  scene  is  stepped  one 
half  the  length  of  the  detector  in  the  x  and  y  directions  producing  a  series  of  four  staring  images. 
These  images  are  then  interlaced  to  produce  the  resulting  microscan  image  of  2Nx2N  pixels,  as 
shown  in  Fig.  3  where  the  reconstructed  image  has  4  times  the  number  of  unique  samples  as  any 
of  the  four  frames.  Thus,  a  level-L  microscanning  effectively  increases  the  sampling  frequency 
by  L  without  changing  the  detector  size  or  spacing.  Since  the  effect  of  the  finite  detector  array 
r(x,y)  is  small  compared  to  the  effects  of  the  detector  integration  and  the  psf,  neglecting  the  effect 
of  r(x,y),  an  LxL  microscan  process  can  be  expressed  as 


i(x,y)  = 


o(x,y)i'  psf{x,yy-^—^rect 


f  x  y  ^ 


\a 


'b 


jrllcomb^ 
^  1=0  ;'= 0 


t  J 


(5) 


where  the  factor  of  1/L2  is  used  to  adjust  for  the  reduction  in  detector  integration  time  at  each  of 
the  L2  microscan  steps. 
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Frame  1 


Frame  2 


Figure  3.  Illustration  of  a  level  2  microscanning  process. 


2.3  Uncontrolled  Microscanning 


Uncontrolled  microscanning  results  from  practical  applications  where  the  imager  is  mounted  on  a 
moving  and/or  vibrating  platform,  such  as  an  aircraft,  and  the  vibrations  associated  with  the 
platform  can  be  exploited  to  create  the  shifts  in  the  acquired  frames.  In  uncontrolled 
microscanning,  the  shifts  for  each  recorded  frame  are  unknown  and  must  be  obtained  before 
forming  an  estimate  of  the  high  resolution  image.  Because  the  shifts  are  random  in  uncontrolled 
microscanning,  it  eliminates  the  need  for  a  microscan  mirror  and  driver  system  otherwise 
required  in  controlled  microscanning.  Since  a  fixed  number  of  frames  is  required  for  a  given 
level  of  microscanning,  and  the  shifts  are  random,  some  of  the  acquired  frames  may  have  almost 
similar  shifts  or  unusable  shifts  resulting  in  one  or  more  missing  frames.  The  missing  frames  will 
generate  empty  bins  in  the  high  resolution  image  reconstruction.  Thus,  the  key  factors  for  the 
high  resolution  image  reconstruction  are  the  accurate  knowledge  of  the  subpixel  translational 
motion  of  the  scene  relative  to  the  FLIR  array  and  the  accurate  estimation  of  the  missing  frames 


for  filling  the  empty  bins  in  the  high  resolution  image.  These  missing  frames  ma  be  estimated 
using  various  algorithms  such  as  the  maximum  likelihood  algorithm4,  maximum  a  posteriori 
algorithm5  or  the  weighted  nearest  neighbor  algorithm6  depending  on  trade-offs  such  as  the 
processing  speed. 

3  Image  Registration 

For  uncontrolled  microscanning,  the  movement  of  the  scene  on  the  array  is  produced  by  random 
motion  and/or  vibration  of  the  imager  platform.  Therefore  the  image  shifts  are  unknown  for  each 
recorded  frame.  It  is  necessary  to  develop  a  suitable  algorithm  which  can  accurately  estimate  the 
registration  parameters  i.e.,  in  our  case,  the  random  translational  shifts.  Various  algorithms  have 
been  reported  in  the  literature  for  estimating  the  image  registration  parameters1’7,8.  Among  these 
techniques,  the  gradient  based  registration  technique  proposed  in  Ref.  7  appears  to  be  particularly 
attractive  for  practical  applications.  However,  this  technique  only  works  for  small  shifts.  To 
accommodate  the  case  where  larger  shift  values  are  anticipated,  we  utilized  the  iterative 
technique  proposed  in  Reference  1  and  incorporated  it  with  the  registration  algorithm  proposed  in 
Reference  7.  In  this  algorithm,  the  first  acquired  frame  is  considered  to  be  the  reference  frame 
and  the  shifts  associated  with  the  remaining  frames  are  calculated  with  respect  to  the  reference 
frame.  If  p  represents  the  total  number  of  frames  acquired  by  the  imager,  and  hk  and  v*  represents 
the  shifts  in  the  horizontal  and  vertical  directions  for  the  kth  frame,  then  the  observed  kth  frame 
may  be  expressed  as 


ok{x,y)  =ol(x  +  hk  ,y  +  vk). 


(6) 


where  k  e  (2,  3,  4 . pj  and  for  the  reference  frame,  hj  =  vj  =  0.  Considering  the  first  three 

terms  for  the  Taylor  series  expansion,  Eq.  (6)  may  be  approximated  as 


,  dox{x,y)  dox{x,y) 
ok{x,y)  ~o{  (x,y)  +  hk - —  +vt  ^ 


(7) 
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Since  Eq.  (7)  is  an  approximation  itself  and  discrete  estimates  of  x  and  y  must  be  used,  it  is  useful 
to  apply  the  method  of  least  squares  for  solving  the  registration  parameters  hk  and  v*.  For  the  least 
squares  solution9  the  error  expression 


Ek(hk’vk) 


1 

MN 


M  N 

II 


m= 1  n= 1 


ok(m,n)-o,(m,n)  —h. 


dox(m,n )  dox{m,n) 


12 


dm 


-- v. 


dn 


(8) 


must  be  minimized,  where  m  and  n  are  discrete  variables  for  the  x  and  y  directions,  M  and  N 
represents  the  total  number  of  pixels  in  the  x  and  y  directions,  and  hk  and  v*  are  the  translational 
shifts  in  the  x  and  y  directions  between  the  Mi  frame  and  reference  frame.  Equation  (8)  can  be 
minimized  by  differentiating  Ek  ( hk  ,vk )  with  respect  to  hk  and  v*  and  setting  the  derivatives  equal 
to  zero.  This  yields  two  equations  which  must  be  solved  simultaneously  and  can  be  conveniently 
represented  in  the  following  matrix  form: 


M  N  (  3  -  / . ^^2 
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Equation  (9)  can  be  represented  in  a  short  form  as 


M  S  =V 


(10) 


where  S  =  [hk  v*  ]r. 
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Therefore,  the  estimated  registration  parameters  can  be  calculated  as 

S  -  M  “V .  (11) 

To  incorporate  the  case  involving  larger  shift  values,  we  follow  the  iterative  technique  of 
Reference  1.  At  first,  the  initial  registration  parameters  are  estimated  using  Eq.  (11).  Using  these 
estimated  shift  parameters,  Ok(m,n)  is  shifted  so  as  to  closely  match  oi(m,n).  The  resulting  image 
is  then  registered  to  oi(m,n).  Using  the  above  mentioned  procedure,  ok(m,n)  is  continuously 
modified  until  the  registration  estimates  become  sufficiently  small.  The  final  registration  estimate 
is  obtained  by  summing  all  of  these  partial  estimates. 

4  Performance  Evaluation  of  the  Registration  Algorithm 

4.1  Effects  of  Temporal  Noise 

Temporal  noise  can  be  defined  as  the  pixel  to  pixel  variations  that  are  temporally  and  spatially 
uncorrelated  from  frame  to  frame  when  the  FLIR  detector  arrays  is  observing  the  scene10'12. 
Temporal  noise  changes  with  each  new  frame  and  generally  appears  as  salt  and  pepper  noise.  It 
may  include  various  sources  such  as  readout  noise,  shot  noise,  quantization  errors,  etc.  which  can 
be  effectively  modeled  as  Gaussian  white  noise. 
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To  evaluate  the  effects  of  spatio-temporal  noise,  a  set  of  9  simulated  microscanned  images  or 
frames  was  generated  from  a  450x450  image  data  collected  on  a  real-time  infrared  imaging 
system  as  shown  in  Fig.  4.  The  FLIR  camera,  manufactured  by  Mitshubishi,  uses  a  512x512 
Platinum  Silicide  Schottky  Barriar  Charge  Sweep  detector  array  with  26x20  micrometer  center- 
to-center  spacing,  16x12  millimeter  chip  area,  13.3x10.2  millimeter  image  area,  and  39%  fill 
factor.  The  sensor  has  a  F#  1.2,  200  millimeter  focal  length  providing  3.5°  horizontal  and  2.75° 
vertical  field  of  view  with  a  response  in  the  3  to  5  micron  wavelength  band.  This  detector  array 
has  1.3%  quantum  efficiency  at  3  micrometers,  0.4%  quantum  efficiency  at  4  micrometers  and 
0.05%  quantum  efficiency  at  5  micrometers.  The  information  about  the  detector  and  lens 
configuration  was  used  in  the  formation  of  the  system  MTF. 

This  450x450  image  was  blurred  by  the  system  MTF  and  sampled  by  a  150x150  detector  array 
where  each  detector  was  simulated  with  a  3x3  kernel  which  returns  the  average  of  all  pixels  that 
fall  upon  it.  The  resulting  low  resolution  image  is  shown  in  Fig.  5.  The  set  of  9  low  resolution 
images  was  generated  using  a  prespecified  microscan  shift  pattern.  Then  Gaussian  white  noise 
was  added  to  each  frame  to  form  a  signal-to-noise  ratio  (SNR)  over  the  interval  of  .2  to  20.  A  low 
resolution  frame  with  added  noise  for  SNR  =  10  is  shown  in  Fig.  6.  It  should  be  mentioned  that 
for  a  given  value  of  SNR,  the  realization  of  noise  for  each  frame  was  different  from  other  frames. 
The  low  resolution  frames  were  then  input  into  the  shift  registration  algorithm  which  calculated 
the  shifts.  The  mean  absolute  error  (MAE)  between  the  shifts  calculated  by  the  registration 
algorithm  and  the  original  programmed  shifts  was  then  calculated  for  each  value  of  the  SNR.  Fig. 
7  shows  a  plot  of  the  MAE  as  a  function  of  the  SNR.  From  Fig.  7,  it  is  evident  that  the  MAE  of 
the  9  horizontal  and  9  vertical  registered  shifts  was  approximately  0.1  of  a  whole  low  resolution 
pixel  for  SNR  =  1 .  The  MAE  goes  down  further  for  higher  values  of  SNR  and  always  remains 
below  0.05  when  the  SNR  is  increased  over  2.  From  these  results,  we  conclude  that  the  shift 
registration  technique  is  very  robust  in  estimating  the  horizontal  and  vertical  registration 
parameters  under  extremely  low  SNR  conditions  (SNR  =  1).  This  performance  may  be  attributed 
to  the  uncorrelated  nature  of  the  noise,  and  the  least  squares  solution  provided  by  the  algorithm. 
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4.2  Effects  of  Fixed  Pattern  Noise 


Fixed  pattern  noise,  also  known  as  spatial  noise,  can  be  defined  as  the  pixel  to  pixel  variations 
that  are  temporally  correlated  from  frame  to  frame  when  the  infrared  detector  arrays  is  observing 
the  scene10'12.  Fixed  pattern  noise  arises  because  the  calibration  and  equalization  models  can  not 
adequately  model  actual  response,  or  the  drift  of  the  sensors  is  beyond  the  capability  of  practical 
recalibration  schemes.  Fixed  pattern  noise  appears  as  a  pattern  that  remains  fixed  on  the  monitor 
as  the  scene  passes  by.  In  general,  fixed  pattern  noise  in  scanning  arrays  will  have  the  appearance 
of  streaks  or  bands  depending  on  the  number  of  detectors  clustering  in  the  process.  The  pattern  of 
the  noise  may  be  fixed,  or  it  may  vary  between  images.  There  are  many  sources  of  fixed  pattern 
noise  such  as  pixel  1/f  noise,  array  1/f  noise,  nonlinear  pixel  nonuniformities,  additive 
nonuniformities,  spectral  nonuniformities  etc.  The  pixel  nonlinearities  are  usually  corrected  at 
calibration  time  but  residual  noise  persists  away  from  the  calibration  points.  Pixel  1/f  noise 
causes  each  pixel  to  drift  with  respect  to  the  other  pixels  on  the  array  in  a  spatially  uncorrelated 
manner.  Array  1/f  noise  is  related  to  instability  external  to  all  the  pixels  and  usually  introduce  a 
pattern  onto  the  image  where  the  pattern  itself  changes  very  little  in  time,  but  the  magnitude  of 
the  pattern  is  variable.  Spectral  nonlinearities  are  pixel  to  pixel  variations  in  the  spectral  response 
of  the  detector  and  can  be  corrected  by  matching  the  chromatic  content  of  the  calibration  with 
that  of  the  image. 

Thus  the  fixed  pattern  noise  is  very  dependent  on  the  type  of  sensor  and  is  difficult  to  model.  To 
obtain  insight  into  the  effects  of  fixed  pattern  noise,  it  is  assumed  that  the  fixed  pattern  noise 
remains  constant  during  the  time  it  takes  to  acquire  the  image  frames.  In  addition,  the  fixed 
pattern  noise  is  assumed  to  be  Gaussian  white  noise. 


The  450x450  image  used  in  the  previous  section  was  blurred  by  the  system  MTF  and  sampled  by 
a  150x150  detector  array  where  each  detector  was  simulated  with  a  3x3  kernel  which  returns  the 
average  of  all  pixels  that  fall  upon  it.  A  set  of  9  low  resolution  images  was  generated  using  a 
predefined  microscan  shift  pattern.  Then  Gaussian  white  noise  was  added  to  each  frame  to  form 
an  SNR  over  the  interval  of  .2  to  20.  A  low  resolution  frame  with  added  noise  for  SNR  =  20  is 
shown  in  Fig.  8.  It  should  be  mentioned  that  for  a  given  value  of  SNR,  the  realization  of  noise  for 
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each  frame  was  the  same.  The  low  resolution  frames  were  then  input  into  the  shift  registration 
algorithm  which  calculated  the  shifts.  The  MAE  between  the  registered  shifts  and  the 
programmed  shifts  was  then  calculated  and  plotted  as  a  function  of  the  SNR  as  shown  in  Fig.  9. 
From  Fig.  9,  it  is  evident  that  the  MAE  of  the  9  horizontal  and  9  vertical  registered  shifts  was 
approximately  0.1  of  a  whole  low  resolution  pixel  for  SNR  =  5.  The  MAE  goes  down  further  for 
increased  values  of  the  SNR.  From  the  aforementioned  results,  we  notice  that  the  fixed  pattern 
noise  has  a  more  deteriorating  effect  on  the  shift  registration  technique  when  compared  to  the 
effect  of  temporal  noise  discussed  in  the  previous  subsection. 

4.3  Effects  of  Aliasing 

An  standard  infrared  imaging  sensor  uses  a  fixed  two-dimensional  detector  array  which 
corresponds  to  fixed  spatial  frequency  determined  by  the  detector  pitch  or  spacing.  According  to 
the  Nyquist  theorem,  aliasing  will  occur  when  sampling  a  scene  containing  spatial  frequencies 
exceeding  half  of  this  sampling  frequency.  Because  most  natural  scenes  are  not  band  limited, 
aliasing  resulting  from  undersampling  can  significantly  degrade  the  utility  and  quality  of  the 
image.  Accordingly,  in  this  subsection,  we  investigate  the  effects  of  aliasing  on  the  shift 
registration  algorithm. 

The  450x450  high  resolution  image  shown  in  Fig.  4  was  blurred  by  the  detector  transfer  function. 
Then  four  different  image  sequences  were  constructed  using  25%,  36%,  56.25%  and  100%  fill 
factors.  Each  of  these  sequences  consists  of  9  low  resolution  frames  generated  using  a 
preselected  microscan  shift  pattern.  Then  Gaussian  white  noise  was  added  to  each  frame  to  form 
an  SNR  over  the  interval  of  .2  to  20.  It  should  be  mentioned  that  for  a  given  value  of  SNR,  the 
realization  of  noise  for  each  frame  was  the  same  i.e.,  fixed  pattern  noise  was  used.  The  low 
resolution  frames  were  then  input  into  the  shift  registration  algorithm  which  calculated  the  shifts. 
The  MAE  between  the  registered  shifts  and  the  programmed  shifts  was  then  calculated  and 
plotted  as  a  function  of  the  SNR  as  shown  in  Fig.  10.  From  Fig.  10,  it  is  evident  that  the  MAE  of 
the  9  horizontal  and  9  vertical  registered  shifts  was  approximately  0.1  of  a  whole  low  resolution 
pixel  for  SNR  =  5  if  the  fill  factor  is  100%.  On  the  other  hand,  the  MAE  of  the  registered  shifts 
was  approximately  0.22  of  a  whole  low  resolution  pixel  for  SNR  =  5  if  the  fill  factor  is  25%  -  an 
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increase  of  error  by  a  factor  of  2.2.  The  same  trend  of  results  were  obtained  for  the  other  fill 
factors.  Therefore,  as  the  fill  factor  is  decreased  the  MAE  increases  rapidly  which  implies  that 
the  MAE  of  the  shifts  registered  by  the  algorithm  increases  with  the  increase  in  aliasing. 
Furthermore,  with  the  increase  in  SNR  the  rate  of  decrease  in  the  MAE  with  smaller  fill  factors 
(such  as  25%)  is  much  lower  than  that  with  larger  fill  factors  (such  as  100%).  Thus,  as  the 
aliasing  increases,  the  shift  registration  technique  becomes  more  prone  to  increased  MAE. 

4.4  Effects  of  Image  Size 

As  mentioned  earlier,  the  shift  registration  algorithm  is  very  robust  in  estimating  the  horizontal 
and  vertical  (i.e.,  x-y)  registration  parameters  under  extremely  low  SNR  conditions.  This  is 
mainly  due  to  the  uncorrelated  nature  of  the  noise,  and  the  least  squares  solution  provided  by  the 
registration  algorithm.  Notice  that  the  least  squares  solution  effectively  averages  over  all  the 
pixels  in  the  images  being  compared  and  consequently  the  noise  effects  tend  to  average  out.  In 
cases  where  the  image  size  is  small,  it  is  expected  that  the  least  squares  solution  will  be  less 
accurate  and  the  registration  algorithm  may  not  yield  very  accurate  results. 

To  investigate  the  effect  of  image  size  on  the  shift  registration  algorithm,  we  considered  the 
450x450  image  used  earlier  which  was  blurred  by  the  system  MTF  and  sampled  by  a  150x150 
detector  array  where  each  detector  was  simulated  with  a  3x3  kernel  which  returns  the  average  of 
all  pixels  that  fall  upon  it.  A  set  of  9  low  resolution  images  was  generated  using  a  prescribed 
microscan  shift  pattern  where  each  frame  consists  of  150x150  pixels.  Then,  13  additional 
sequences  of  frames  were  generated  by  gradually  reducing  the  number  of  pixels  from  each  frame 
(such  as  140x140,  130x130,  120x120,  ...,  30x30,  and  20x20).  Each  set  of  low  resolution  frames 
were  then  input  into  the  shift  registration  algorithm  which  calculated  the  shifts  for  each  case.  The 
MAE  between  the  shifts  calculated  by  the  registration  algorithm  and  the  original  programmed 
shifts  was  then  calculated  and  the  peak  MAE  for  each  set  of  frames  was  recorded.  Fig.  1 1  shows 
a  plot  of  the  MAE  as  a  function  of  the  frame  size  where  the  numbers  along  the  X-axis  indicate 
the  frame  size  (e.g.,  a  number  20  indicates  the  size  of  each  frame  is  20x20).  From  Fig.  11,  it  is 
evident  that  the  MAE  of  the  9  horizontal  and  9  vertical  registered  shifts  was  less  than  0.1  of  a 
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whole  low  resolution  pixel  for  sequences  with  frame  sizes  30x30  and  above,  and  the  MAE  was 
less  than  0.04  of  a  whole  low  resolution  pixel  for  sequences  with  frame  size  larger  than  40x40.  It 
may  be  mentioned  that  the  aforementioned  results  were  obtained  in  the  absence  of  noise. 

Next,  we  investigated  the  effect  of  image  size  on  the  shift  registration  algorithm  in  the  presence 
of  fixed  pattern  noise.  We  considered  five  different  sets  of  frame  sequences  with  frame  size 
equaling  100x100,  80x80,  60x60,  40x40,  and  20x20  respectively.  Then  Gaussian  white  noise 
was  added  to  each  frame  to  form  an  SNR  over  the  interval  of  .2  to  20.  It  should  be  mentioned 
that  for  a  given  value  of  SNR,  the  realization  of  noise  for  each  frame  was  the  same.  The  low 
resolution  frames  from  each  set  were  then  input  into  the  shift  registration  algorithm  which 
calculated  the  shifts.  The  MAE  between  the  registered  shifts  and  the  programmed  shifts  was  then 
calculated  and  plotted  as  a  function  of  the  SNR  as  shown  in  Fig.  12.  From  Fig.  12,  it  is  evident 
that  as  the  number  of  pixels  in  each  frame  of  a  sequence  decreases,  the  MAE  increases  and  the 
rate  of  increase  is  higher  with  smaller  number  of  pixels  per  frame. 

4.5  Effects  of  Image  Contrast 

To  investigate  the  effect  of  image  contrast  on  the  registration  algorithm,  various  contrast  imagery 
produced  from  simulations  of  the  identical  scene  at  different  times  of  day  and  year  are  needed. 
Figure  13  shows  a  real  450x450  infrared  image  obtained  at  the  Wright  laboratory  using  the  3-5 
|im  thermal  imaging  system  discussed  earlier.  The  histogram  of  this  image  is  shown  in  Fig.  14a. 
Since  it  is  important  to  adjust  the  contrast  of  the  imagery  to  represent  the  contrast  in  typical 
infrared  imagery  as  closely  as  possible,  five  additional  images  were  created  from  this  image  by 
gradually  modifying  the  image  histogram  as  shown  in  Figs.  14b  through  14f.  A  set  of  9  low 
resolution  images  was  generated  from  each  of  these  images  (corresponding  to  Figs  14a  through 
14f)  using  a  priori  selected  microscan  shift  pattern.  Then  Gaussian  white  noise  was  added  to 
each  frame  to  form  an  SNR  over  the  interval  of  .2  to  20.  It  should  be  mentioned  that  for  a  given 
value  of  SNR,  the  realization  of  noise  for  each  frame  was  the  same.  The  low  resolution  frames 
from  each  set  were  then  input  into  the  shift  registration  algorithm  which  calculated  the  shifts.  The 
MAE  between  the  registered  shifts  and  the  programmed  shifts  was  then  calculated  and  plotted  as 
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a  function  of  the  SNR  with  image  contrast  and  variance  as  parameters  as  shown  in  Fig.  15.  We 
have  defined  image  contrast  as 

Contrast  =  ' ~  J (12> 

•*  max  min 


where  1^  refers  to  the  maximum  value  of  image  intensity  while  /min  refers  to  the  minimum  value 
of  image  intensity.  From  Fig.  15,  it  is  obvious  that  for  a  low  contrast  image  (i.e.  a  typical  infrared 
image),  the  MAE  is  considerably  high.  However,  the  variation  in  image  contrast  (corresponding 
to  the  image  histograms  of  Figs.  14a  through  14f)  has  little  impact  on  the  registration  algorithm. 
The  MAE  was  found  to  decrease  gradually  with  the  increase  of  SNR. 

5  Conclusion 

The  image  registration  algorithm  proposed  by  Irani  and  Peleg1  has  been  found  to  be  an  extremely 
fast  technique  when  compared  to  the  alternate  algorithms.  This  algorithm  has  been  demonstrated 
to  work  with  non  uniformity-corrected,  aliased  imagery  collected  from  a  laboratory  3-  to  5-p.m 
thermal  imaging  system.  However,  the  performance  of  this  algorithm  over  all  imaging  conditions 
is  unknown.  Accordingly,  in  this  research  project,  we  have  evaluated  the  performance  of  this 
registration  algorithm  under  various  levels  of  degrading  factors  including  the  fixed-pattern  spatial 
noise,  temporal  noise,  aliasing,  contrast,  and  image  size.  Plots  of  registration  error  versus 
degradation  measure  has  been  provided  in  order  to  indicate  the  general  trend  of  effects  of  each  of 
these  degrading  factors.  Among  these  degrading  factors,  the  temporal  noise  has  been  found  to 
have  the  least  deteriorating  impact  on  the  algorithm  while  the  other  factors  introduce  varying 
levels  of  errors  in  estimating  the  translational  shifts.  It  was  also  observed  that  the  image 
registration  error  or  the  MAE  decrease  gradually  with  the  increase  in  fill  factor  (i.e.,  a  decrease  in 
aliasing).  Furthermore,  the  image  registration  error  also  decreases  with  the  increase  in  image  size. 
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Figure  4.  A  450x450 


pixel  high  resolution  image  collected  on  a  real-time  infrared  imaging  system. 
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hcure  5.  A  150x150  low  resolution  ini  ace  collected  bv  the  detector  array  from  the  450x450  pixel 
high  resolution  image  where  each  detector  was  simulated  with  a  3x3  kernel  which  returns  the 
average  of  all  pixels  that  fall  upon  it. 
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Figure  6.  A  low  resolution  frame  with  added  noise  for  a  signal-to-noisc  ratio  (SNR)  of  10. 
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Registered  Shift  MAt 


Registered  Shift  MAE  vs  SNR 


SNR 


rc  7.  Plot  of  the  mean  absolute  error  (MAE)  as  a  function  of  SNR  showing  the  efiect  of 
o-temporal  no.se  on  the  registration  algorithm. 


Figure  8.  A  low  resolution  frame  with  added  noise  for  a  signal-to-noise  ratio  (SNR)  of  20. 
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Registered  Shift  MAE 


Registered 


Figure  9.  Plot  of  MAE  as  a  function  of  SNR 
registration  algorithm. 
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Registered  Shift  MAE  vs  SNR 


Figure  12.  Plot  of  MAE  as  a  function  of  SNR  with  frame  size  as  a  parameter  showing  the  effect  of 
different  frame  size  on  the  registration  algorithm  in  the  presence  of  fixed  pattern  noise. 
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Figure 
a  3-5  (I 


Registered  Shift  MAE 


Registered  Shift  MAE  vs  SNR 


Figure  15.  Plot  of  MAE  as  a  function  of  SNR  image  contrast  and  variance  as  parameters  showing 
the  effect  of  variation  in  image  contrast  on  the  registration  algorithm. 
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Abstract 

The  development  of  microstructure,  macrotexture,  and  microtexture  during  subtransus 
hot  working  of  Ti-6A1-4V  with  two  different  types  of  transformed  beta  starting 
microstructures  (lamellar  colony,  acicular/martensitic  alpha)  was  studied. 
Globularization  of  the  transformed  microstructures  required  heavy  rolling  reductions  or 
moderate  reductions  coupled  with  near  transus  post-rolling  heat  treatment.  Noticeable 
macrotexture  changes  were  noted  after  low  reductions,  an  effect  ascribed  to  the 
rotations  associated  with  kinking  and  bending  of  the  lamellar/acicular  plates. 

Noticeable  microtextures,  noted  in  samples  with  an  initial  lamellar  colony 
microstructure,  persisted  through  hot  rolling  suggesting  that  dynamic  globularization 
does  not  involve  recrystallization.  In  contrast,  hot  rolled  material  with  a  starting 
acicular  alpha  microstructure  exhibited  weak  microtextures  and  strong  macrotextures. 
The  absence  of  microtexture  in  these  latter  cases  was  explained  on  the  basis  of  multiple 
transformation  variants  within  each  colony/prior  beta  grain  following  the  beta- 
annealing-and-water-quenching  process  used  to  obtain  the  microstructure. 
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TEXTURE  .MICROSTRUCTURE  AND  MICROTEXTURE 
OF  HOT  ROLLED  Ti-6A1-4V 


Pnina  Ari-Gur 

1.  Introduction: 

Ti-6A1-4V  is  a  high-strength  light-weight  material  that  has  been  in  extensive  use 
for  aerospace  applications.  Its  processing  is  difficult  -  it  is  not  workable  at  low 
temperatures,  while  at  temperatures  above  75%  of  the  solidus  temperature,  its 
developing  microstructure  is  hard  to  control  [1]. 


At  room  temperature  and  up  to  990°C,  Ti-6A1-4V  has  a  dual  phase  (a+P)  structure. 
The  a-phase  (HCP),  the  dominant  phase  throughout  this  temperature  range,  is  highly 
anisotropic  in  nature.  The  second  phase  (P)  is  BCC,  and  is  the  single  phase  present 
above  the  transus  temperature  (990°C).  A  number  of  investigators  have  examined  the 
kinetics  of  the  globularization  of  alpha/beta  titanium  alloys  during  subtransus  hot 
working  [2-7] .  Most  of  these  efforts  have  focused  on  monotonic  deformation  involving 
modes  of  loading  such  as  uniaxial  compression,  tension,  or  simple  torsion.  However, 
most  mill  processing  relies  on  multipass  operations  involving  small  reductions  per  pass 
and  intermediate  reheating  and/or  annealing  steps.  Thus,  it  may  be  difficult  to  correlate 
laboratory  and  production  observations. 

Although  globularized  alpha  microstructures  may  be  free  of  mechanical  texture,  it 
is  well  known  that  a  considerable  degree  of  crystallographic  texture  may  be  produced 
[8,9].  Because  of  the  anisotropic  nature  of  the  hexagonal  alpha  phase,  such  textures 
may  affect  both  first-  and  second-tier  service  properties  such  as  elastic  modulus  and 
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fatigue  resistance  [8,10].  It  is  also  becoming  apparent  now  that  the  development  of 
microtexture,  or  large  regions  of  similarly  oriented  alpha  grains,  may  impact  second- 
tier  properties  such  as  dwell  fatigue  resistance  [11]. 

The  aligned  a  colonies  can  be  traced  back  to  the  original  p  (BCC)  grain  that 
existed  above  the  transus  temperature.  “Breaking”  the  original  microstructure,  as  well 
as  elimination  of  the  micro-texture  (local  aligned  colonies),  are  what  thermo-mechanical 
processing  [1]  is  aimed  at. 

The  goal  of  this  work  was  to  study  the  effects  of  initial  microstructure,  hot-rolling 
parameters  (temperature  and  percent  reduction),  and  post-rolling  heat  treatments  on  the 
evolution  of  microstructure,  crystallographic  texture,  and  micro-texture. 

2.  Experimental  Procedures 

2. 1  Material 

The  material  used  in  this  investigation  was  received  as  30  mm  thick  hot-rolled 
plate  of  Ti-6A1-4V.  Its  measured  composition  in  weight  percent  was  6.09  aluminum, 
3.95  vanadium,  0.18  iron,  0.19  oxygen,  0.02  carbon,  0.01  nitrogen,  and  0.010 
hydrogen,  with  the  balance  being  titanium.  The  beta  transus  temperature  was  995  °C. 

Samples  measuring  76  x  102  mm  were  cut  from  the  as-received  plate  and  given 
one  of  two  heat  treatments  -  beta  annealed  plus  controlled  cooled  or  beta  annealed  plus 
water  quenched.  The  former  treatment  hereafter  referred  to  as  BACC  comprised 
940°C/20  min.  +  1065°/15  min.  -I-  940°C/5  min.  +  815°C/15  min.  +  air  cool.  The 
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microstructure  developed  by  this  means  consisted  of  a  mixture  of  colony  and 
basketweave  alpha  with  the  prior  beta  grain  boundaries  decorated  with  a  layer  of  alpha 
(Figure  la).  The  prior-beta  grain  size  was  approximately  500  pm,  and  the  colonies 
were  approximately  250  pm  in  size.  Samples  given  the  beta-annealed-plus-water- 
quenched  treatment  (subsequently  referred  to  as  BAWQ)  were  heated  at  940°C/20  min. 
+  1065°C/15  min.  and  then  water  quenched.  This  treatment  produced  a  fine  acicular 
(martensitic  alpha)  microstructure  with  a  similar  prior-beta  grain  size  but  no  grain¬ 
boundary  alpha  (Figure  lb). 

2.2  Thermomechanical  Processing  (TMP) 

The  BACC  and  BAWQ  samples  were  processed  using  various  rolling  or  rolling- 
plus-heat-treatment  sequences  to  assess  the  effect  of  thermomechanical  treatments  on 
the  breakdown  of  the  transformed-beta  type  microstructures  and  the  development  of 
texture  and  microtexture.  Rolling  was  done  subtransus  at  temperatures  of  815,  925,  or 
970°C.  Samples  were  furnace  heated  for  40  minutes  prior  to  rolling  and  reheated  3 
minutes  between  each  pass.  The  rolling  speed  was  25  cm/s,  and  the  reduction  per  pass 
was  12  percent  (two  higher  rolling  temperatures)  or  5  percent  (lowest  rolling 
temperature).  Rolling  was  continued  until  a  total  thickness  reduction  of  either  44  or  77 
percent  (corresponding  to  thickness  strains  of  0.57  and  1.47  or  effective  strains  of  0.66 
and  1.69,  respectively)  had  been  achieved.  Following  rolling,  all  samples  were  air 
cooled  to  simulate  production  practice. 

To  determine  the  effect  of  static  versus  dynamic  processes  on  globularization  and 
or  recrystallization  of  the  transformed  beta  microstructures  and  the  concomitant  texture 
and  microtexture  evolution,  samples  cut  from  the  hot  rolled  sheets  were  given  either  a 
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mill  anneal  or  a  higher  temperature  solution  annealing  treatment.  The  mill  anneal  was 
conducted  at  700°C  for  one  hour  followed  by  air  cooling;  this  treatment  did  not  lead  to 
noticeable  changes  in  microstructure  but  was  necessary  to  relieve  residual  stresses,  etc., 
for  the  successful  application  of  the  orientation  imaging  technique  for  the  determination 
of  microtexture.  Solution  heat  treatments  were  conducted  for  two  hours  at  temperatures 
between  815  °C  and  970°C  followed  by  air  cooling.  Details  of  the  post-processing  heat 
treatments  are  given  in  Table  1 . 


Table  1:  Solution  Annealing  Treatments 


Group  Number 

Temperature 

Time 

Cooling 

1 

Same  as  rolling 

2  hours 

Air 

2 

870°C 

2  hours 

Air 

3 

955°C 

2  hours 

Air 

2.3  Post-TMP  Characterization 

Following  rolling  or  rolling  plus  heat  treatment,  microstructure,  macrotexture, 
and  microtexture  were  determined.  Microstructure  was  quantified  through  optical 
metallography  on  thickness  cross-sections  cut  parallel  to  the  rolling  direction. 
Crystallographic  texture  and  microtexture  were  determined  for  the  alpha  phase.  For  the 

determination  of  global  texture  (or  so-called  macrotexture),  standard  (0002)  pole 
figures  were  measured  via  a  back  reflection  technique  using  copper  Ka  radiation. 

Samples  for  such  measurements  were  prepared  by  grinding  and  polishing  25  x  25  mm 
coupons  to  the  midplane.  For  ease  of  visualization  of  the  primary  macrotexture 
components,  the  pole  figures  were  processed  through  the  program  popLA  [12]  to  obtain 
graphic  representations  of  the  components.  The  local  (spatial)  variation  of  texture  (or 
microtexture)  was  obtained  by  analysis  of  electron  backscatter  patterns  (EBSP)  in  a 
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scanning  electron  microscope  [13,14]  For  this  purpose,  thickness  cross-sections  parallel 
to  the  transverse  direction  were  cut  and  prepared  metallographically.  A  region  close  to 
the  rolled  surface  and  one  at  the  midplane  were  analyzed.  In  each  case,  the  area 
sampled  was  approximately  4  mm^  over  which  40,000  discrete  measurements  were 
made.  These  microtexture  data  were  represented  in  the  form  of  icosahedral  and  total 
Eulerian  plots  delineating  the  orientation  of  the  (0002)  basal  poles  so  determined. 
Icosahedral  plots  are  generated  by  dividing  the  spherical  projection  of  the  offset  angle 
between  the  (0002)  poles  and  the  fixed  sample  axes  into  10  regions.  Each  region  is 
represented  by  a  different  color,  so  identical  color  indicates  similar  orientations.  Points 
where  indexing  is  not  possible  (“no  hit”)  are  white.  Total  Eulerian  plots  are  essentially 
similar  to  the  icosahedral  plots,  except  that  there  are  thousands  of  color  possibilities 
instead  of  10.  The  background  for  the  total  Eulerian  plots  is  black. 

3.  Results  and  Discussion 

3. 1  Microstructure 

The  as-rolled  microstructures  of  the  BACC  and  BAWQ  specimens  are  shown  in  Figures 
2  and  3.  For  the  BACC  material,  rolling  to  the  lower  reduction  s  =  0.66  gave  rise  to 
relatively  little  globularization.  The  majority  of  the  microstructure  was  characterized 
by  bent,  higher  aspect-ratio  alpha  plates  (Figure  2b).  In  contrast,  rolling  to  a  higher 
reduction  a  =  1.69  yielded  a  sizable  proportion  of  alpha  grains  with  aspect  ratios  of 
less  than  3:1  at  all  three  rolling  temperatures  (815,  925,  970°C)  as  shown  in  Figures  2 
a,  c,  e.  By  and  large,  the  effect  of  reduction  and  rolling  temperature  on  the 
globularization  of  BAWQ  specimens  (Figure  3)  was  very  similar  to  the  observations  for 
the  BACC  samples.  This  is  particularly  evident  for  samples  rolled  at  970 °C  (Figures  3 
c,  d  vs.  Figures  2  d,  e);  the  resemblance  between  the  two  sets  of  microstructures  at 
970  °C  may  be  ascribed  to  continuous  coarsening  of  the  acicular  alpha  plates  which 
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occurs  very  rapidly  during  preheating  at  this  temperature  [15].  The  coarsening  kinetics 
are  somewhat  slower  at  925 °C.  Thus,  the  globularized  microstructure  in  BAWQ 
samples  rolled  at  925  °C  (Figure  3b)  was  somewhat  finer  than  that  in  the  corresponding 
BACC  condition  (Figure  2c). 

The  effect  of  post-rolling  heat  treatment  on  the  degree  of  globularization  is 
illustrated  in  Figures  4,  5  and  6.  Little  increase  in  globularization  is  evident  after  two- 
hour  heat  treatments  at  870°C  (Figures  4  and  5)  when  compared  to  the  as-rolled 
microstructure  (Figure  2).  On  the  other  hand,  samples  which  were  only  lightly  rolled 
at  925  °C  were  globularized  to  a  large  extent  after  a  two-hour  heat  treatment  at  925  °C 
(Figures  6b,  c). 

3. 2  Macrotexture  Development 

The  (0002)  pole  figures  measured  for  the  as-heat-treated  BACC  and  BAWQ 
samples  are  shown  in  Figure  7.  Both  materials  exhibited  a  pronounced  macrotexture. 

In  times  random  units,  the  BACC  had  three  principal  texture  components  -  TD  (6X),  a 
so-called  45°  component  (8X),  and  RD  (2X)  -  and  the  BAWQ  had  two  components  - 
TD  (16X)  and  45°  (10X),  where  RD  and  TD  refer  to  rolling  direction  and  transverse 
direction,  respectively.  Because  of  the  large  beta  grain  size  and  hence  moderate 
number  of  colonies  inspected  to  obtain  the  pole  figures,  the  textures  may  seem  stronger 
than  they  actually  are.  Therefore,  the  relative  intensities  may  be  somewhat 
approximate  and  should  thus  be  interpreted  in  semi-quantitative  terms.  Nevertheless, 
there  is  no  doubt  that  annealing  the  Ti-6A1-4V  plate  in  the  beta  field  did  not  result  in  a 
random  macrotexture. 
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The  present  beta-annealed  pole  figures  (Figure  7)  can  be  compared  to  those 
measured  by  previous  investigators.  Moustahfid,  et  al.  [16]  utilized  a  Ti-6A1-4V  sheet 
material  which  had  been  processed  through  a  series  of  thermomechanical  treatments 
including  a  final  cold  rolling  step  and  mill  anneal.  Prior  to  beta  annealing,  this  material 
exhibited  a  strong  basal  texture  ((0002)  poles  parallel  to  the  sheet  normal).  After  beta 
annealing,  the  alpha  phase  exhibited  two  very  strong  components  in  the  (0002)  pole 
figure,  an  RD  component  and  a  component  between  the  sheet  normal  and  RD. 
Moustahfid,  et  al.  [16]  concluded  that  some  complex  form  of  texture  variant  selection 
must  have  occurred,  because  the  six  possible  variants  characterizing  the  alpha-to-beta 
transformation  and  the  twelve  variants  for  the  beta-to-alpha  transformation  would  not 
have  produced  such  strong  textures  in  the  absence  of  such  a  variant  selection.  In 
contrast  to  the  findings  of  Moustahfid,  et  al..  Peters  and  Luetjering  [17]  found  that  beta 
annealing  followed  by  water  quenching  eliminated  the  alpha-phase  texture  of  Ti-6A1-4V 
plate  material  that  had  been  produced  by  beta  forging. 

The  alpha-phase  macrotextures  of  the  BACC  and  BAWQ  samples  after  hot  rolling 
are  summarized  in  Figure  8.  These  (0002)  pole  figures  reveal  that  the  major  texture 
components  were  the  TD  and  RD  ones,  whereas  basal  and  45°  textures  were  found  in 
only  a  few  cases.  The  trends  are  quantified  by  the  data  in  Table  2,  which  shows  the 
predominance  of  the  TD  component  in  all  cases.  The  TD  component  in  the  rolled 
materials  may  be  thought  to  have  evolved  from  the  TD  component  present  in  the  as- 
heat  treated  materials.  It  is  also  possible  that  the  TD  component  could  come  from  the 
decomposition  of  hot  worked  beta  phase  as  noted  by  Gey,  et  al.  [18]. 

The  results  in  Table  2  also  reveal  the  noticeable  absence  of  a  45°  texture 
component  (except  for  low  reduction  hot  rolling  of  BAWQ  material  at  925 °C).  As 
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noted  above,  both  the  BACC  and  BAWQ  samples  had  strong  45°  components  in  the  as- 
heat-treated  condition.  The  absence  of  the  45 0  component  is  even  more  remarkable  in 
view  of  its  elimination  after  relatively  low  deformation  levels.  This  observation  may  be 
explained  on  the  basis  of  the  kinking  and  bending  of  lamellar  plates  at  relatively  low 
strains  noted  in  Section  3.1.  Such  kinking  could  give  rise  to  noticeable  plate  rotations 
and  thus  texture  changes. 

Table  2.  Pole  Figure  Measurements  of  Texture  Components  Developed  in  Ti-6A1-4V 


Material 

Condition 

Rolling 

Temperature  (°C) 

Reduction* 

Texture  Components 

BACC 

As  Heat  Treated 

TD  (6X),  45°  (8X),  RD  (2X) 

BACC 

Hot  Rolled 

815 

High 

TD  (9X),  Basal  (2X) 

BACC 

Hot  Rolled 

925 

Low 

TD  (6X),  RD  (4X) 

BACC 

Hot  Rolled 

925 

High 

TD  (8X),  RD  (3X) 

BACC 

Hot  Rolled 

970 

Low 

TD  (8X),  RD  (6X) 

BACC 

Hot  Rolled 

970 

High 

TD  (8X),  RD  (3X)  - 

BAWQ 

As  Heat  Treated 

TD  (16X),  45°  (10X) 

BAWQ 

Hot  Rolled 

925 

Low 

TD  (8X),  45°  (2X) 

BAWQ 

Hot  Rolled 

925 

High 

TD  (8X),  RD  (3X) 

BAWQ 

Hot  Rolled 

970 

Low 

TD  (4X),  RD  (4X) 

BAWQ 

Hot  Rolled 

970 

High 

TD  (9X),  RD  (4X) 

*  Low  reduction  =  44  pet,  high  reduction  =  77  pet. 


The  development  of  RD  texture  components  in  the  as-hot  rolled  materials  (Table 
2)  may  also  be  controlled  by  texture  changes  associated  with  lamellar  plate  kinking  and 
bending.  For  example,  the  BACC  samples  which  had  a  2X  RD  component  in  the  as- 
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heat  treated  condition,  showed  a  noticeably  stronger  RD  component  after  light 
reductions.  At  the  higher  deformation  levels,  however,  the  transient  nature  of  RD 
texture  was  manifested  by  a  decreasing  sharpness  while  the  TD  component  remained 
relatively  sharp. 

The  almost  total  absence  of  the  basal  texture  components  in  the  rolled  Ti-6A1-4V 
samples  contrasts  sharply  with  the  observations  of  Peters  and  Luetjering  [17],  who 
measured  noticeable  components  of  this  type  following  subtransus  hot  rolling  over  a 
wide  temperature  range.  This  difference  may  be  as  least  partly  due  to  different  textures 
prior  to  hot  rolling.  Further  work  is  needed  to  establish  the  exact  source  of  the 
difference  in  macrotexture. 

The  effect  of  post-rolling  heat  treatment  on  the  macro-texture  is  illustrated  in 
Figures  9  and  10.  The  TD  component  is  still  evident  after  the  heat  treatment  and  in 
addition  the  45°  component  reappears. 

3.3  Microtexture  Evolution 

The  EBSP  measurements  to  quantify  the  alpha  phase  microtexture  provide 
additional  insight  into  the  interdependence  of  micro  structural  and  textural  evolution. 
The  microtexture  results  to  be  discussed  pertain  solely  to  the  midplane  region  of  the 
plate/sheet  samples,  i.e.,  the  location  at  which  the  pole  figures  had  been  measured. 

Figure  1 1  summarizes  the  alpha-phase  microtextures  of  the  as-heat  treated  BACC 
and  BAWQ  materials  in  both  icosahedral  and  total  Eulerian  forms.  The  BACC  material 
(Figures  11a,  b)  showed  very  distinct  patches  of  various  colors.  These  patches  are 
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approximately  250  pm  in  size  and  can  be  correlated  to  the  colony  features  in  the  optical 
microstructure  (Figure  la).  The  uniformity  of  the  color  within  a  given  patch  verifies 
that  all  the  alpha  plates  within  it  have  the  same  orientation  as  expected.  In  addition,  the 
irregular  contours  of  the  patches  bear  a  resemblance  to  the  irregular  outlines  of  the 
colonies  thus  verifying  that  they  are  indeed  representative  of  these  microstructural 
features  rather  than  the  prior-beta  grains  whose  shapes  are  largely  equiaxed.  In 
contrast,  the  BAWQ  microtexture  showed  less  distinct  patches  of  color  (Figure  11c,  d). 
This  can  be  ascribed  to  the  fact  that  each  colony  (or  beta  grain)  contains  several 
different  variants  of  acicular  plates.  Despite  the  fact  that  the  as-heat  treated  BAWQ 
material  did  not  appear  to  have  a  noticeable  microtexture,  the  fact  that  it  exhibited  a 
strong  macrotexture  (Figure  7b,  Table  2)  indicates  that  relatively  few  variants  were 
selected  during  the  beta-to-alpha  prime  transformation. 

After  hot  rolling,  the  BACC  material  retained  evidence  of  microtexture. 

Examples  from  samples  rolled  to  a  large  reduction  at  815  or  925°  C  are  shown  in  Figure 
12.  In  both  cases,  the  color  patches  have  an  elongated  shape.  Bearing  in  mind  that  the 
EBSP  measurements  were  taken  on  cross  sections  containing  the  transverse  direction 
and  sheet  normal,  this  elongation  is  as  expected  for  the  plane-strain  reduction 
characterizing  rolling.  To  be  specific,  after  a  77  percent  reduction  (i.e. ,  the 
deformation  in  the  heavy  reduction  experiments),  a  colony  measuring  originally  250  x 
250  pm  in  a  transverse  cross  section  would  deform  to  a  size  of  approximately  60  x  250 
pm,  in  rough  agreement  with  the  observations.  The  plots  in  Figure  12  also  reveal  that 
the  deformed  colonies  exhibit  relatively  few  different  colors.  This  behavior  can  be 
explained  on  the  basis  of  the  development  of  a  sharp  macrotexture  (including  the 
disappearance  of  the  45 0  macrotexture  component)  which  would  lead  to  a  reduction  in 
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the  overall  spread  of  orientations  and  thus  number  of  different  colors  that  would  be 
evident  on  icosahedral  or  Eulerian  plots. 

Taken  together,  the  microstructure,  macrotexture,  and  microtexture  data  for  the 
hot  rolled  BACC  Ti-6A1-4V  material  provide  insight  into  the  globularization  process 
during  hot  working.  The  results  suggest  large  macrotexture  changes  at  low  strains  due 
to  plate  bending/kinking  but  with  limited  globularization  of  the  plates.  Because  a 
microtexture  is  retained  to  high  strains,  it  can  be  inferred  that  discontinuous 
recrystallization  (which  would  tend  to  randomize  the  orientations  within  a  given  colony) 
is  probably  not  occurring  to  a  significant  extent.  Rather,  strain  localization  processes 
within  the  alpha  plates  and  subsequent  penetration  of  the  beta  phase,  as  described  by 
Weiss,  et  al.  [3,21],  can  be  deduced  to  be  the  mechanism  by  which  globularization 
would  proceed  while  maintaining  the  similar  identity  of  globularized  alpha  grains 
within  a  prior  colony. 

Post-rolling  heat  treatment  of  BACC  samples  resulted  in  a  significant  change  in 
microtexture,  but  not  necessarily  a  reduction  in  its  prominence.  For  example,  the 
evolution  of  microtexture  for  a  BACC  sample  that  was  lightly  rolled  at  925  °C  and 
subsequently  heat  treated  at  925  °C  is  shown  in  Figure  13.  There  was  not  a  noticeable 
reduction  in  size. 

Microtexture  developed  in  a  BAWQ  sample  via  rolling  to  a  heavy  reduction  at 
970 °C  is  shown  in  Figure  14.  Similar  to  the  as-heat  treated  condition  (Figure  11c,  d), 
the  rolled  BAWQ  samples  did  not  exhibit  a  very  strong  microtexture.  Furthermore,  as 
for  the  heavily  rolled  BACC  samples,  the  microtexture  plots  for  the  rolled  BAWQ 
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samples  revealed  few  colors  indicating  a  narrow  spread  in  orientations,  a  trend  that 
would  be  expected  based  on  the  development  of  a  strong  macrotexture. 

4.  Conclusions 

The  evolution  of  microstructure,  macrotexture,  and  microtexture  during  the 
thermomechanical  processing  of  Ti-6A1-4V  with  transformed  beta  starting 
microstructures  was  determined.  The  following  conclusions  are  drawn  from  this  work: 

1.  Moderately  high  reductions  ('  75  pet.)  are  required  to  globularize  colony  or 
acicular  (martensitic  alpha)  microstructures  dynamically  via  hot  rolling.  Lower 
reductions  ('  40  pet.)  in  conjunction  with  near  subtransus  post-rolling  heat  treatments 
may  also  be  used  to  effect  globularization. 

2.  Beta  annealing  followed  by  controlled  cooling  (to  form  a  lamellar  colony 
structure)  or  water  quenching  (to  form  martensitic  alpha)  does  not  produce  a  random 
macrotexture. 

3.  The  principal  alpha  macrotexture  component  developed  during  unidirectional  hot 
rolling  consists  of  one  for  which  the  basal  poles  are  aligned  transverse  to  the  rolling 
direction.  The  predominance  of  this  TD  texture  component  even  at  low  reductions 
suggests  that  kinking  and  bending  of  lamellar  plates  at  low  strains  play  a  key  role  in  the 
development  of  macrotexture  and  the  occurrence  of  flow  softening. 

4.  The  persistence  of  noticeable  microtexture  during  the  subtransus  hot  rolling  of  Ti- 
6A1-4V  with  a  lamellar  colony  starting  microstructure  suggests  that  globularization  of 
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the  alpha  plates  does  not  occur  by  recrystallization  but  by  localized  deformation  of  the 
plates  and  penetration  by  the  beta  phase. 

5.  Strong  macrotextures  in  Ti-6A1-4V  with  transformed  beta  or  globularized  alpha 
microstructures  can  be  associated  with  either  weak  or  strong  microtextures  depending 
on  the  nature  of  the  transformed  beta  (i.e.,  lamellar  colony  versus 
basketweave/acicular) . 
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Figure  1 .  Optical  microstructures  of  Ti-6A1-4V  program  materials:  (a)  beta  annealed  and 
controlled  cooled  (BACC),  (b)  beta  annealed  and  water  quenched  (BAWQ). 
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Figure  2.  Optical  microstructures  developed  in  hot  rolled  Ti-6A1-4V  BACC  samples. 

Rolling  temperatures  and  total  reductions  (in  percent)  were:  (a)  815°C/77,  (b) 
925°C/44.  (c)  925°C/77,  (d)  970°C/44.  and  (e)  970°C/77.  In  all  micrographs,  the 
rolling  direction  is  horizontal,  and  the  thickness  direction  is  vertical. 


Figure  3.  Optical  microstructures  developed  in  hot  rolled  Ti-6A1-4V  BAWQ  samples. 

Rolling  temperatures  and  total  reductions  (in  percent)  were:  (a)  925°C/44, 
(b)  925  C/77,  (c)  970°C/44,  and  (d)  970°C/77.  In  all  micrographs,  the 
rolling  direction  is  horizontal,  and  the  thickness  direction  is  vertical. 
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Figure  4.  Optical  microstructures  developed  in  hot  rolled  Ti-6A1-4V  BACC  samples, 
after  solution  annealing  at  870°C.  Rolling  temperatures  and  total 
reductions  (%)  were:  (a)  815°C/77,  (b)  925°C/77,  (c)  925°C/44,  and  (d) 
970° C/44.  In  all  micrographs,  the  rolling  direction  is  horizontal,  and  the 
thickness  direction  is  vertical. 
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Figure  5.  Optical  microstructures  developed  in  hot  rolled  Ti-6A1-4V  BAWQ  samples, 
after  solution  annealing  at  870°C.  Rolling  temperatures  and  total 
reductions  (%)  were:  (a)  970°C/77,  (b)  925°C/77,  (c)  925°C/44,  and  (d) 
970° C/44.  In  all  micrographs,  the  rolling  direction  is  horizontal,  and  the 
thickness  direction  is  vertical. 
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Figure  6.  Optical  microstructures  developed  in  hot  rolled  Ti-6A1-4V  samples,  after 
solution  annealing  at  a  the  same  temperature  as  their  rolling  temperature. 
Preform  microstructure/rolling  temperature/total  reduction  (in  percent)/heat 
treatment  temperature  were  as  follows:  (a)  BACC/815°C/77/815°C,  (b) 

B ACC/925 0 C/77/925  °C ,  (c)  BAWQ/925°C/44/925°C,  (d) 
BAWQ/970°C/44/970°C.  In  all  micrographs,  the  rolling  direction  is 
horizontal,  and  the  thickness  direction  is  vertical. 


Figure  7.  (0002)  alpha-phase  pole  figures  for  Ti-6A1-4V  rolling  preform  material: 
(a)  BACC  condition,  (b)  BAWQ  condition. 
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Figure  9.  (0002)  alpha-phase  pole  figures  for  hot  rolled  Ti-6A1-4V  samples. 

Preform  microstructure/rolling  temperature/total  reduction  (%)/heat 
treatment  temperature  were  as  follows: 

(a)  BACC/970°C/77/955  °C,  (b)  BACC/925°C/77/955°C,  (c) 
BACC/815°C/77/955°C,  (d)  BAWQ/970°C/77/955°C,  (e) 
BAWQ/925  °C/77/955  °C 
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(0002)  alpha-phase  pole  figures  for  hot  rolled  Ti-6A1-4V  samples. 
Preform  microstructure/rolling  temperature/total  reduction  (in 
percent)/heat  treatment  temperature  were  as  follows:  (a) 
BACC/970°C/77/970°C,  (b)  BACC/925°C/77/925°C,  (c) 
BACC/815°C/77/815°C,  (d)  BAWQ/970°C/77/970°C,  (e) 
BAWQ/925  °C/77/925  °C. 


Figure  11.  Initial  (mid  plane)  alpha-phase  microtextures  for  Ti-6A1-4V  rolling 
preforms:  (a,b)  BACC  condition,  and  (c,d)  BAWQ  condition.  (a,c)  are 
icosahedral  plots  and  (b,d)  are  total  Eulerian  plots. 
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Figure  12.  Midplane  alpha-phase  microtextures  for  Ti-6A1-4V  BACC  samples  rolled  to 
77  percent  reduction  in  thickness  at  (a)  815°C  or  (b)  925°C.  Both  plots  are 
total  Eulerian  representations. 
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(a)  (b) 


Figure  13.  Microtextures  of  a  BACC  sample  that  was  rolled  to  a  44  percent  reduction 
in  thickness  at  925  3C:  (a)  as  hot  rolled,  or  (b)  rolled  and  heat  treated  at 
925  °C. 


Figure  14.  Total  Eulerian  representation  of  the  midplane  alpha-phase  microtexture  for 
a  Ti-6A1-4V  BAWQ  sample  rolled  to  77  percent  reduction  in  thickness  at 
970°C. 
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MULTI-SITE  AND  WIDESPREAD  FATIGUE  DAMAGE  IN  AIRCRAFT  STRUCTURE  IN 

THE  PRESENCE  OF  PRIOR  CORROSION 


J.D.  Baldwin 
Assistant  Professor 

School  of  Aerospace  &  Mechanical  Engineering 
University  of  Oklahoma 

Abstract 

This  study  explores  the  impact  of  corrosion-induced  pillowing  on  stress  intensity  factors  for 
comer  cracks  in  countersunk  rivet  holes.  Such  deformation  patterns  have  been  observed  in 
aircraft  fuselage  lap  joints.  The  boundary  element  method  was  used  to  compute  the  stresses  and 
stress  intensity  factors  for  a  section  of  lap  joint  loaded  by  faying  surface  pressure  as  well  as  the 
fuselage  pressurization  hoop  stress.  Circular-front  comer  cracks  were  constructed  in  the 
countersink  varying  in  length  from  0.01  to  0.08  inches  at  angular  orientations  varying  from  10 
degrees  to  80  degrees  from  the  hoop  stress  direction.  The  stress  analysis  showed  that  a  distinct 
compressive  region  is  produced  in  the  countersink  by  the  pillowing  pressure  and  that  this 
compression  is  not  cancelled  out  by  the  hoop  stress.  Such  a  compressive  region  is  felt  to  be 
sufficient  to  cause  any  cracks  that  form  in  the  countersink  to  be  driven  toward  the  joint  faying 
surface. 
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MULTI-SUE  AND  WIDESPREAD  FATIGUE  DAMAGE  IN  AIRCRAFT  STRUCTURE  IN 

THE  PRESENCE  OF  PRIOR  CORROSION 


J.D.  Baldwin 


Introduction  and  Background 

As  a  result  of  military  budget  constraints  and  the  quest  for  greater  commercial 
competitiveness,  aircraft  are  now  being  utilized  far  beyond  their  original  designed  operational 
life  spans.  The  United  States  Air  Force  operates  over  600  Boeing  C/KC-135  aircraft  that  have 
been  in  service  35  years  or  more.  These  aircraft  were  designed  in  the  1950’s  and  are  cousins  to 
the  civilian  Boeing  707.  Because  these  aircraft  are  crucial  elements  in  the  force  projection 
strategy  of  the  USAF,  it  has  been  proposed  that  they  should  remain  in  service  well  past  the  year 
2017.' 

Corrosion  was  an  unforeseen  factor  not  included  in  the  initial  service  life  extension 
studies.  However,  it  is  pervasive,  unpredictable  and  often  undetectable.  Corrosion  damage 
repair  costs  the  USAF  more  money  than  any  other  structurally  related  item.  A  recent  study 
funded  by  Warner  Robins  Air  Logistics  Center  conservatively  estimated  that  corrosion  alone 
costs  the  USAF  $700  million  annually.2  Since  the  costs  of  replacing  this  fleet  of  large  jet  aircraft 
are  beyond  the  scope  of  current  budgets,  Boeing  and  the  USAF  have  been  forced  to  confirm  the 

feasibility  of  extending  their  operational  lives. 

Figure  1  schematically  illustrates  the  criteria  for  determining  inspection  schedules  and  the 
potential  effects  of  corrosion  damage  on  these  schedules.  In  the  damage  tolerance  approach  used 
by  the  USAF,  a  crack  of  length  a0  is  assumed  to  exist  in  the  structure  at  the  time  of  its 
manufacture.  This  length  is  generally  considered  to  be  the  maximum  length  that  is  undetectable 
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by  current  inspection  procedures.  The  crack  is  further  assumed  to  exist  at  the  worst-case  location 
and  orientation.  The  non-corroded  a  versus  N  curve  is  then  used  to  predict  the  number  of  loading 
cycles,  Ni,  necessary  for  the  crack  to  grow  from  its  initial  length,  ao,  to  the  critical  length  at 
which  structural  failure  would  occur,  ac.  Inspections  would  then  be  scheduled  at  half  this 
number  of  cycles,  Vz  N/,  to  identify  a  crack  of  length  <2,,  triggering  repair  or  replacement  of  the 
damaged  structure.  Inspections  are  scheduled  at  half  of  the  critical  number  of  cycles  in  order  to 
afford  two  inspections  before  the  crack  reaches  critical  length.  The  effects  of  corrosion,  though, 
could  alter  these  predictions  in  two  ways.  Material  damage  and  changes  in  the  stress  state  may 
tend  to  decrease  the  critical  crack  length  and  accelerate  the  rate  of  crack  growth.  These  effects 
are  seen  as  a  change  in  ac  and  a  steeper  a  versus  N  curve  in  Figure  1.  As  a  result,  the  number  of 
cycles  to  reach  the  critical  crack  length  is  reduced  to  N2.  If  corrosion  is  neglected  and  the 
inspection  schedule  is  not  changed  to  Vi  N2,  the  likelihood  of  detecting  a  crack  before  it  reaches 
its  critical  length  is  reduced. 
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Number  of  Cycles,  N 

Figure  1:  Crack  length  vs.  number  of  fatigue  cycles  indicating  critical  points  and  the  effects  of  corrosion  on 

crack  growth. 

Initial  service  life  extension  estimates  for  the  C/KC-135  fleet  have  been  based  on  fatigue 
damage  accumulation  alone,  and  predict  that  the  fleet  will  be  viable  until  at  least  the  year  2020. 
However,  as  a  result  of  decades  of  exposure  to  both  natural  elements  and  pollution,  corrosion  has 
become  an  increasingly  important  factor  in  the  deterioration  of  airframe  structures.  This  may 
lead  to  a  reduction  in  the  inspection  intervals  needed  to  detect  damage  and  maintain  a  safe  fleet, 
as  illustrated  in  Figure  1.  Quantification  of  the  impact  of  corrosion  on  crack  growth  and  residual 
strength  is  still  a  topic  of  great  interest.  The  experimentally  determined  material  response  data 
do  not  yet  exist  to  form  hard  conclusions  regarding  the  modification  of  inspection  intervals. 

For  the  damage  tolerance  philosophy  to  be  truly  successful,  it  is  important  to  determine 
the  level  of  corrosion  damage  at  which  fatigue  life  is  seriously  affected.  The  damaging  effects  of 
corrosion  manifest  themselves  in  a  variety  of  ways.  These  include  surface  pitting,  exfoliation, 
stress  corrosion  cracking,  and  pillowing.  Isolated  pitting,  in  and  of  itself,  has  little  effect  on  the 
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strength  of  a  structure.  It  can,  though,  result  in  a  multitude  of  crack  initiation  sites  that  could  lead 
to  fatigue  damage.  More  widespread  pitting  can  result  in  an  effective  reduction  of  material 
thickness,  limiting  the  load  carrying  capacity  of  the  structure.  Exfoliation  is  caused  by  cracks 
propagating  parallel  to  material  surfaces  resulting  in  a  significant  reduction  in  the  effective 
material  thickness.  Stress  corrosion  cracking  is  most  often  an  intergranular  separation,  the 
mechanisms  of  which  are  still  under  debate.  Pillowing  is  a  corrosion  phenomenon  that  is 
particularly  common  in  fuselage  lap  joints.  As  the  faying  surfaces  between  the  lap  joint  layers 
corrode,  the  trapped  corrosion  products  build  up  leading  to  out-of-plane  deformations  (Figure  2). 
The  USAF  aircraft  maintenance  community  is  keenly  interested  in  the  effects  of  pillowing  on 
fatigue  cracking. 


Figure  2:  D-Sight  image  of  a  simple  shear  lap  joint  Corrosion  pillowing  is  visible  to  the  right3 


Sites  of  particular  interest  are  countersunk  rivet  holes  in  pillowed  fuselage  joints.  The 
processes  of  machining  rivet  holes  and  inserting  fasteners  often  damage  aircraft  skins  sufficiently 
to  create  possible  crack  initiation  sites.  The  combination  of  these  material  imperfections  with  the 
stress  concentrations  inherent  to  holes  results  in  ideal  conditions  for  the  initiation  and 
propagation  of  comer  cracks.  A  comer  crack  is  a  specific  form  of  surface  crack  characterized  by 
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the  intersection  of  the  crack  with  both  surfaces  that  intersect  to  form  a  comer.  Surface  cracks  do 
not  extend  entirely  through  the  material  thickness,  which  distinguishes  them  from  through 
cracks.  One  troubling  phenomenon  that  has  been  observed  in  pillowed  lap  joints  on  operational 
aircraft  is  the  tendency  for  surface  cracks  to  initiate  on  the  faying  surface,  possibly  as  comer 
cracks,  and  propagate  into  high  aspect  ratio  elliptical  cracks  (Figure  3)  without  breaking  through 
the  top  surface.  This  results  in  very  large  cracks  that  can  not  be  detected  by  visual  inspection. 
Another  characteristic  of  these  cracks  is  their  orientation  away  from  normal  to  the  hoop  stress 
(Figure  4). 


Figure  3:  Cross-section  of  an  upper  lap  joint  layer  showing  high  aspect 


ratio  crack  along  the  faying  surface. 3 


Figure  4:  Faying  surface  cracks  not  aligned  with  the  normal  to  the  hoop  stress  (cracks  indicated  by  arrows). 
The  crack  in  Figure  3  appears  at  the  lower  right  side  of  the  rivet  hole. 
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Previous  Research 


Recent  efforts  have  been  made  to  further  our  understanding  of  the  stress  states  and  altered 
material  properties  that  result  from  corrosion  induced  pillowing  deformation.  In  the  past,  stress 
and  fatigue  analysts  used  a  reduction  in  the  nominal  material  thickness  to  account  for  the  effects 
of  corrosion.  This,  however,  severely  underestimates  the  damage  caused  by  corrosion  by  failing 
to  take  into  account  the  significantly  altered  stress  state  that  results  from  pillowing.  Studies 
investigating  the  stresses  created  by  corrosion  and  the  resultant  pillowing  have  been  conducted 
by  Komorowski, 4  et  al.,  at  the  National  Research  Council  -  Canada.  In  their  finite  element 
studies,  a  uniform  pressure  applied  to  the  faying  surface  was  used  to  simulate  the  forces  resulting 
from  a  buildup  of  corrosion  products.  Their  results  have  indicated  that  if  faying  surface 
corrosion  exists,  the  stress  increase  due  to  thickness  reduction  is  negligible  when  compared  to  the 
stresses  induced  by  pillowing.  This  holds  true  for  thickness  reduction  and  pillowing  compared 
separately  and  in  combination.  Therefore,  modeling  pillowing  alone  provides  the  best 
combination  of  realism  and  simplicity.  Another  result  of  the  research  by  Komorowski,  et  al.,  was 
a  correlation  between  the  level  of  corrosion,  measured  as  a  percent  thickness  loss,  and  the 
maximum  out-of-plane  displacement  caused  by  a  buildup  of  corrosion  products.  The 
investigators  were  attempting  to  obtain  calibration  data  to  be  applied  to  D-Sight  (Diffracto  Ltd., 
Windsor,  Ontario,  Canada)  optical  nondestructive  inspection  (NDI)  techniques.  D-Sight  is  a 
surface  inspection  technique  that  uses  a  light  source  and  retro-reflective  screen.  This  could  allow 
the  quick  and  accurate  determination  of  the  extent  of  corrosion  in  lap  joints  while  avoiding  many 
of  the  problems  associated  with  other  NDI  methods.5  Their  calculations  were  based  on  the 
observation  that  typical  corrosion  products  in  lap  joints  consist  mainly  of  aluminum  oxide 
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trihydrate  and  a  proportionally  small  amount  of  aluminum  oxide  monohydrate.  A  volume  ratio 
for  aluminum  oxide  trihydrate  to  pure  aluminum  of  6.5  to  1  was  used  to  calculate  the  volume 
required  to  accommodate  the  products  generated  by  corrosion  of  a  given  percentage  of  material 
thickness.  Using  a  uniform  pressure  on  the  lower  surface  of  the  outer  skin,  the  deflections 
necessary  to  contain  the  corrosion  products  were  calculated  using  the  closed-form  classical  plate 
theory.6  Only  the  upper  skin  was  modeled  due  to  the  assumed  symmetry  of  the  deformed  joint. 


Figure  5:  Geometry  of  a  round  corner  crack  in  a  straight  bore  hole. 


Because  the  cracks  in  the  countersunk  hole  may  begin  as  comer  cracks,  they  can  be 
thought  of  as  extensions  to  the  case  of  round  comer  cracks  in  a  straight  bore  hole.  The  geometric 
definition  of  this  crack  case  is  shown  in  Figure  5.  The  first  detailed  examination  of  the  stress 
intensity  factors  in  comer  cracks  in  straight  bore  holes  was  given  by  Smith  and  Kullgren  in  1977 
by  means  of  the  finite  element  alternating  method.  In  1979,  Raju  and  Newman  used  a  three- 
dimensional  finite  element  method  to  compute  stress  intensity  factors  for  two  symmetric  comer 
cracks  on  opposite  sides  of  a  straight  bore  hole.  Analysis  of  these  data  was  subsequently 
presented  in  the  form  of  curve  fits9  that  permitted  interpolation  between  the  analysis  cases. 
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Nishioka  and  Atluri10  solved  the  same  symmetric  crack  cases  considered  by  Raju  and  Newman8 
using  the  finite  element  alternating  method  and  found  good  agreement  with  the  prior  results 
except  at  the  bore  of  the  hole  where  the  two  solutions  diverged.  The  source  of  the  error  in  the 
earlier  finite  element  solutions  was  subsequently  found  to  be  poorly  shaped  elements  at  the  hole 
surface.11  Revised  finite  element  solutions  were  given  by  Shivakumar  and  Newman1-  in  1991. 
In  1990,  Zhao  and  Wu13  presented  a  weight  function  solution  for  the  comer  crack  problem  by 
combining  a  series  of  two-dimensional  solutions  to  form  the  three-dimensional  stress  intensity 
data.  It  is  important  to  note  that  all  of  the  prior  solutions  referenced  above  have  been  for  straight 
bore  holes  (i.e.,  with  no  countersink  feature).  The  only  known  set  of  solutions  for  stress  intensity 
factors  in  countersunk  holes  is  due  to  Tan,  et  al.14.  However,  these  studies  have  focused 
primarily  on  uniaxial  tension  and  the  pure  fatigue  crack  growth  resulting  from  variable  loading. 
The  unique  cracking  conditions  that  exist  near  rivet  holes  in  corroded  lap  joints  have  been 
neglected.  It  was  the  goal  of  the  research  presented  here  to  make  a  first  attempt  at  describing  the 
effects  of  pillowing  on  the  properties  of  elliptical  cracks  in  countersunk  rivet  holes. 

Objectives 

Unlike  the  simple  two-dimensional  through  crack,  real  cracks  exist  in  three  dimensions, 
are  often  elliptical  in  shape,  and  do  not  extend  entirely  through  the  body.  Under  these  conditions, 
the  evaluation  of  stress  intensity  becomes  a  more  complex  matter,  but  it  is  still  a  function  of  the 
stress  field  and  crack  geometry.  Various  numerical  techniques  based  on  crack  face 
displacements  are  employed  to  facilitate  these  calculations,15  but  will  not  be  described  in  detail 
as  they  are  not  essential  to  the  concept  of  stress  intensity  factors.  One  goal  of  this  study  was  to 
provide  a  preliminary  analytical  database  of  Mode  I  stress  intensity  factors  for  cracks  emanating 
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from  countersunk  rivet  holes  at  different  angular  orientations,  with  the  focus  on  the  geometry  and 
conditions  found  in  C/KC-135  fuselages.  The  second  objective  was  to  model  three-dimensional 
crack  propagation  into  the  pillowed  regions  and  to  identify  the  differences  between  these  cracks 
and  the  planar  cracks  used  in  the  preliminary  investigation  of  stress  intensity  factors.  The 
remainder  of  this  report  will  discuss  the  tools  and  modeling  techniques  used  to  achieve  these 
goals,  as  well  as  the  results  of  the  study. 

Methods  of  Analysis 

The  tool  chosen  for  the  calculation  of  stress  intensity  factors  was  FRANC3D,  a  boundary 
element  program  developed  by  the  Cornell  University  Fracture  Group  lead  by  Dr.  Anthony 
Ingraffea.16  Unlike  finite  element  methods  (FEM)  which  require  the  discretization  of  the  body 
volume,  boundary  element  methods  (BEM)  require  only  the  discretization  of  the  body  surface. 
This  inherently  leads  to  fewer  elements  being  needed  to  accurately  represent  a  body,  especially 
for  three-dimensional  problems.  Since  the  complexities  of  countersunk  hole  geometry  with  both 
in-plane  and  out-of-plane  loading  were  such  that  a  three-dimensional  analysis  was  necessary, 
BEM  offered  advantages  where  model  generation  efforts  were  concerned.  Another  benefit  of 
requiring  fewer  elements  to  describe  the  structure  is  a  decrease  in  the  number  of  algebraic 
equations  that  must  be  solved.  In  addition,  BEM  reduces  the  dimensionality  of  any  given 
problem  by  one.  For  instance,  a  three-dimensional  problem  results  in  a  two-dimensional 
boundary  integral  equation.  However,  because  boundary  element  methods  produce  fully 
populated  system  matrices  for  homogeneous  regions,  BEM  cannot  use  the  efficient  sparse  matrix 
solvers  employed  in  the  reduction  of  the  larger,  sparsely  populated  finite  element  matrices.  Thus, 
the  boundary  element  method  requires  a  greater  number  of  computations  to  obtain  each 
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component  of  the  solution  matrix  compared  to  FEM.  This  offsets  to  some  degree  the 
computational  advantages  of  fewer  equations,  lower  dimensionality,  and  decreased  matrix 
reduction.  The  benefits  of  BEM  are  best  realized  when  working  with  models  of  large  physical 
dimensions.17 

The  major  advantage  of  FRANC3D  and  its  supporting  software  was  their  ability  to 
facilitate  solid  modeling  and  discretization  of  complex  geometries.  FRANC3D  uses  a  hierarchy 
of  models  that  combine  to  fully  describe  the  geometry,  material  properties,  and  boundary 
conditions  of  the  structure  to  be  analyzed.  The  basis  of  the  hierarchy  is  the  solid  model.  Unlike 
most  engineering  simulations  in  which  the  structure  and  boundary  conditions  must  be  described 
entirely  by  a  mesh  and  its  attributes,  FRANC3D  uses  the  solid  model  to  specify  geometry, 
material  properties,  and  boundary  conditions.  These  properties  are,  therefore,  part  of  the  basic 
description  of  the  structure  and  independent  of  any  mesh  applied  for  analytical  purposes.  Initial 
uncracked  geometries  are  created  using  the  Object  Solid  Modeler  ( OSM)  software  developed  by 
the  Cornell  Fracture  Group.  These  geometry  files  are  then  interpreted  by  FRANC3D  where 
boundary  conditions  and  material  properties  are  applied  directly  to  geometric  volumes,  faces, 
edges,  and  vertices.  Material  properties  are  assigned  to  the  three-dimensional  continuum  and 
become  a  part  of  the  basic  model.  The  analytical  model,  or  mesh,  is  then  overlaid  on  the  solid 
model  and  inherits  the  properties  assigned  to  the  solid  model.  In  this  way,  different  meshes  can 
be  applied  to  the  model  without  the  user  needing  to  redefine  the  elemental  or  nodal  attributes. 
The  stress  analysis  is  done  by  the  Boundary  Element  System  ( BES)  software.  FRANC3D 
provides  post-processing  functions.  A  significant  limitation  of  FRANC3D  (and  most  BEM 
systems)  is  that  it  is  limited  to  linear,  elastic  analysis,  which  dictated  that  the  stresses  induced  in 
the  model  must  remain  below  the  yield  point  of  the  material.19 
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Cracks  in  FRANC  3D  are  created  in  the  geometry  model  and  are,  therefore,  inherited  by 
the  analytical  model.  The  capability  exists  to  model  internal  cracks,  through  cracks,  and  surface 
cracks,  though  only  surface  cracks  were  considered  in  this  study.  Surface  cracks  have  at  least 
one  intersection  with  a  boundary  surface.  Comer  cracks  like  the  ones  studied  here  intersect  at 
least  two  surfaces  (Figure  5)  and  possibly  more,  depending  on  the  particular  crack  geometry. 
Cracks  are  modeled  as  a  main  face  and  a  mate  face  that  are  topologically  separate  but 
geometrically  identical.  This  also  applies  to  the  edges  and  vertices  that  define  the  crack  faces, 
with  the  exception  of  the  crack  front  edges  and  vertices.  This  allows  the  crack  faces  to  separate 
in  the  analytical  model  without  imposing  an  artificial  finite  gap  in  the  geometry.  Such  a  gap 
could  cause  computational  errors  introduced  by  rounding  and  point  co-location. 

FRANC3D  calculates  stress  intensity  factors  ( Kt ,  Kn,  and  Km)  using  the  crack  opening 
displacement  technique.  Values  for  K,,  crack  opening  mode  stress  intensity  factors,  are 
calculated  at  discrete  points  along  the  crack  front  based  on  the  displacement  of  points  on  the 
crack  faces  some  measured  distance  from  the  crack  front.  Referring  to  Figure  6,  Ki  is  calculated 

based  on  the  formula15: 

K  -  JL  [4(vb'-vd>(vc'-ve')]  0) 

where  G  is  the  shear  modulus,  k  is  (3-v)/(l+v)  in  cases  of  plane  stress  or  (3-4v)  for  plane  strain 
and  axisymmetry,  r  is  Poisson’s  ratio,  v'  is  the  displacement  normal  to  the  crack  axis  at  the 
indexed  point,  and  L  is  the  distance  between  point  A  and  points  C  and  E  in  Figure  6. 
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Figure  6:  Schematic  for  crack  opening  displacement  correlation. 


Analytical  Models 

Typical  fuselage  lap  joints  on  C/KC-135  aircraft  consist  of  two  overlapping  aluminum 
(2024-T3)  skins  of  equal  thickness  (typically  0.063  in.).  These  are  held  together  by  countersunk 
rivets  and  are  attached  to  a  structural  stringer  as  shown  in  Figure  7.  The  faying  surfaces  on 
repaired  aircraft  may  also  be  coated  with  a  sealant  intended  to  prevent  or  delay  corrosion  between 
the  layers.  The  rivets  have  a  typical  shank  diameter  of  0.1875  in.  and  100°  countersunk  heads 
and  are  installed  on  one-inch  centers  both  horizontally  and  vertically.  Evidence  from  operational 
aircraft  has  shown  that  the  upper  row  of  rivets  is  the  critical  location  in  crack  initiation. 
However,  in  the  presence  of  corrosion  pillowing,  observed  cracks  are  rarely  oriented  normal  to 
the  hoop  stresses  caused  by  fuselage  pressurization.  Instead,  they  tend  to  start  in  the  lower  half 
of  the  rivet  holes  and  propagate  into  the  pillowed  regions  between  adjacent  holes  as  shown  in 
Figure  4. 
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OUTER/UPPER  SKIN 


Figure  7:  Schematic  of  C/KC-135  fuselage  lap  joint.5 


Preliminary  Model 

Since  the  critical  cracking  location  is  on  the  upper  skin,  it  alone  was  modeled  using 
boundary  element  methods  (BEM)  in  FRANC3D.  Because  the  sealant  and  adhesive  layers  must 
deteriorate  for  corrosion  to  occur,  neither  was  included  in  this  analysis.  Fuselage  curvature 
effects  were  assumed  negligible  and  were,  therefore,  not  incorporated  into  any  of  the  models. 
The  initial  model  (Figure  8)  was  created  to  determine  the  ability  of  the  FRANC3D  software  to 
duplicate  the  general  overall  stress  states  achieved  by  Bellinger,  et  al.,4  under  similar  loading 
conditions.  This  model  consisted  of  two  rows  and  three  columns  of  rivet  holes  simulating  two 
pillowed  areas  and  their  immediate  surroundings.  Hoop  stresses  were  modeled  by  a  12  ksi 
tensile  traction  along  the  left  surface.  The  straight  shanks  of  the  rivet  holes  were  constrained  to 
have  no  displacements  normal  to  their  surfaces,  simulating  the  presence  of  rivets.  In  addition, 
global  z-axis  translations  were  constrained  in  the  countersunk  regions  of  the  holes  to  simulate  the 
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clamping  effects  of  rivet  head  loads.  A  uniform  pressure  was  applied  to  the  faying  surface  to 
achieve  out-of-plane  pillowing  displacements,  and  symmetry  conditions  were  imposed  on  the  top 
and  bottom  edge  surfaces. 


SYMMETRY  CONDITIONS 


NO  DISPLACEMENT 
NORMAL  TO  A__ 
straight  $-a\<s 


miuitnimm 

X  085  ksi 

PILLGVING  load 


Figure  8:  Initial  model  and  boundary  conditions  used  for  a  qualitative  evaluation  of  FRANC3D. 


The  out-of-plane  displacement  contours  calculated  by  FRANC3D  (Figure  9)  agreed 
qualitatively  with  those  obtained  by  Komorowski,  et  al.,  in  their  elastic-plastic  finite  element 
analysis.5  Displacements  and  stresses  could  not  be  directly  compared  due  to  the  limitations 
imposed  by  the  linear  elastic  formulation  of  FRANC3D.  Unfortunately,  the  accompanying  scale 
by  which  to  interpret  the  contour  mapping  is  not  included  in  FRANC3D  output  files.  The  light 
areas  surrounding  the  rivet  holes  correspond  to  the  regions  of  least  displacement,  which  goes  to 
zero  on  the  countersinks.  The  darkest  areas  in  the  center  of  the  pillowed  regions  correspond  to 
the  regions  of  maximum  z-displacement  (w=0.0048  inch). 
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Figure  9:  Out-of-plane  displacement  (w)  contours  in  initial  model. 


Model  for  Crack  Analysis 

Once  the  analytical  capabilities  of  BES  were  qualitatively  confirmed,  a  second,  smaller 
model  was  constructed  for  the  fracture  mechanics  analyses  (Figure  10).  The  second  model  was 
used  to  reduce  computational  time  while  duplicating  the  stress  state  established  by  the  first 
model.  This  model  consisted  of  only  one  pillowed  area  representing  the  region  between  the 
upper  and  middle  rows  of  rivets.  Boundary  conditions  on  the  rivet  hole  shanks  and  countersunk 
regions  were  identical  to  those  used  in  the  larger  model.  Similarly,  a  12  ksi  traction  was  applied 
along  the  left  surface.  Symmetry  conditions  were  again  applied  to  the  top  and  bottom  edge 
surfaces  with  an  additional  symmetry  condition  imposed  on  the  right  surface.  Since  FRANC3D 
is  only  capable  of  linear  elastic  analysis,  stresses  in  the  model  had  to  be  limited  to  the  yield 
strength  of  2024-T3,  which  is  approximately  48  ksi.18  Pressure  was  applied  to  the  faying  surface 
such  that  this  criterion  was  not  exceeded.  By  trial  and  error,  it  was  found  that  a  uniform  pressure 
of  0.85  ksi  on  the  faying  surface,  when  combined  with  the  other  boundary  conditions,  induced  a 
maximum  principal  stress  of  48  ksi  in  the  uncracked  model  (von  Mises  stresses  are  not  reported 
by  FRANC3D).  The  resulting  deflection  at  the  center  of  the  pillowed  region  was  0.0048  inch. 


Based  on  the  relationship  between  central  deflection  and  percent  thickness  lost  to  corrosion 
formulated  by  Bellinger,5  et  al.,  this  deflection  corresponds  approximately  to  a  2.5%  thickness 
reduction.  It  is  clear  from  this  result  that  a  very  small  amount  of  corrosion  is  sufficient  to  stress 
the  lap  joint  material  beyond  its  yield  point. 


SYMMETRY  CONDITION 


Figure  10:  Solid  model  with  boundary  conditions  used  for  crack  analysis. 

The  z-axis  displacements  (w)  resulting  from  the  loading  and  boundary  conditions 
described  in  Figure  10  are  shown  in  Figure  11.  Again,  the  lightest  regions  surrounding  the  rivet 
holes  correspond  to  zero  out-of-plane  displacement,  while  the  center  of  the  pillowed  region  was 
displaced  0.0048  inch.  Under  the  same  conditions,  the  overall  stress  field  in  the  model  is 
illustrated  in  Figure  12.  As  indicated  in  the  overall  picture  and  the  close-up  of  the  stress  field 
surrounding  the  lower  left  rivet  hole  (Figure  13),  the  majority  of  the  structure  is  in  a  state  of 
tensile  stress.  However,  a  region  of  compressive  stress,  induced  by  the  pillowing  deformation, 
exists  in  the  material  near  the  top  of  the  countersink  surface. 
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Figure  11:  Out-of-plane  displacement  ( w )  contours 
on  top  surface  of  crack  analysis  model. 


Figure  12:  Principal  stress  field  (o>)  on  top 
surface  of  crack  analysis  model. 


Figure  13:  Principal  stress  field  (ct/)  on  the  surface  of  the  lower  left  rivet  hole  in  Figure  12. 


Analytical  Crack  Characteristics 


All  of  the  cracks  analyzed  in  the  countersunk  region  of  the  model  were  planar,  circular- 
front,  comer  cracks  radiating  from  the  rivet  hole  at  the  faying  surface  as  shown  in  Figure  14. 
Crack  front  shapes  were  based  on  the  Findings  of  Willard19  illustrated  in  Figure  15  for  cracks 


near  countersunk  holes  in  specimens  subjected  to  uniaxial  tension.  These  afforded  the  closest 
available  approximation  since  little  research  has  been  done  on  crack  front  shapes  under  the 
conditions  specific  to  this  study.  Crack  lengths,  a,  were  measured  by  the  distance  from  the 
comer  of  the  hole  to  any  point  on  the  round  crack  face  and  varied  between  0.01  inch  and  0.08 
inch  inclusive.  These  were  non-dimensionalized  with  respect  to  the  material  thickness,  a  =  alt , 


to  give  a  range  of  a=  0.159-1.27  ( t  =  0.063  inch).  Cracks  were  oriented  every  10  degrees 


between  the  model  edges,  resulting  in  eight  angular  orientations  between  10°  and  80°  inclusive, 


as  shown  in  Figure  16.  Since  the  y-axis  is  normal  to  the  hoop  stress,  it  was  used  as  the  reference 


for  crack  orientation.  Thus,  a  10°  crack  is  oriented  10°  from  the  normal  to  the  hoop  stress. 


Figure  14:  Crack  lengths,  a,  measured  from  the  corner  along  the  faying  surface. 
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Figure  15:  Micrograph  montage  showing  corner  crack  growth  near  a  countersunk  hole.  Marker  bands 
showing  crack  front  locations  at  constant  intervals  are  highlighted  and  numbered. 


Figure  16:  Schematic  of  crack  orientations,  fi. 

Although  boundary  element  methods  are  relatively  insensitive  to  element  shape  and 
aspect  ratio  on  most  surfaces,  both  factors  were  taken  into  account  on  the  crack  faces  near  the 
crack  front.  The  creators  of  FRANC3D  recommend  that  the  elements  along  the  crack  front  be 
nearly  square  to  assure  accurate  stress  recovery  in  the  stress  intensity  factor  calculations.  This 
was  easily  achieved  through  bilinear  mapping  on  the  crack  faces  near  the  front.  A  convergence 


study  was  conducted  to  determine  the  minimum  number  of  elements  required  along  the  crack 
front.  A  crack  of  length  a  =  0.01  inch  oriented  at  /?=  10°  was  modeled  using  15,  20,  and  25 
elements  along  the  crack  front.  The  resulting  values  of  Kj  are  shown  in  Figure  17.  The  stress 
intensity  factors  calculated  from  20  and  25  crack  front  elements  were  nearly  identical,  while 
those  calculated  from  15  elements  showed  minor  irregularities.  In  order  to  obtain  adequate 
resolution  with  as  few  elements  as  possible,  all  subsequent  cracks  were  modeled  with  20  nearly 
square  elements  along  their  crack  fronts  as  shown  in  Figure  18. 
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Figure  17:  Comparison  of  distributions  calculated  with  15,  20,  and  25  crack  front  elements. 
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Figure  18:  Typical  crack  face  mesh  with  nearly  square  elements  along  the  crack  front. 

In  addition  to  the  survey  of  stress  intensity  factors  for  uniform  planar  cracks  in  the 
countersink,  cracks  in  the  pillowed  region  were  also  considered.  Planar  through  cracks  were 
nucleated  from  the  rivet  hole  at  orientations  of  10°,  30°,  60°,  and  80°  as  defined  above,  and 
extended  0.05  inch  beyond  the  countersink.  Subsequent  cracks  were  then  formed  using  two 
different  methods.  The  first  was  uniform  planar  extension,  resulting  in  a  change  of  length  only. 
This  allowed  for  an  analysis  similar  to  that  done  in  the  countersink.  The  second  method  was 
through  the  use  of  an  automated  calculation  in  FRANC3D  based  on  a  logarithmic  formulation 
similar  to  Paris’s  Law.16  The  equation  used  by  FRANC3D  to  calculate  the  extension  was: 


where  5max  is  the  user-specified  maximum  extension  of  the  crack  tip  and  b  is  a  parameter 
obtained  from  the  relationship  between  stress  intensity  factors  and  the  growth  of  through  cracks 
in  two-dimensional  structures  under  plane  strain.  The  terms  K[  and  Ki  max  are  the  stress  intensity 
factor  at  a  specific  point  and  the  maximum  stress  intensity  factor  on  the  crack  front,  respectively. 
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The  default  value  of  b  =  1  was  left  unchanged  because  insufficient  information  is  available  as  to 
the  growth  rates  of  these  cracks  to  make  a  more  accurate  determination.  The  method  used  to 
determine  the  direction  of  crack  propagation  must  be  chosen  by  the  user  from  the  following 
theories:  the  maximum  tangential  stress  theory,  the  maximum  strain  energy  release  rate,  and  the 
minimum  strain  energy  density.16  Again,  due  to  limited  information  about  these  cracks,  the 
default  selection  of  the  maximum  tangential  stress  theory  was  used. 

Results 

Stress  intensity  factors  were  calculated  for  cracks  throughout  the  countersunk  region 
surrounding  the  rivet  hole.  These  were  grouped  for  analysis  by  both  angular  orientation  and 
crack  length.  For  each  angular  orientation,  the  stress  intensity  factors  for  all  cracks  analyzed  at 
that  orientation  were  plotted  versus  the  normalized  crack  front  distance  (Figures  19  through  26). 
The  normalized  crack  front  distances  are  arc  lengths  along  the  crack  front  measured  from  the 
faying  surface  and  normalized  with  respect  to  the  total  crack  front  length.  These  plots  were  used 
to  visualize  the  way  in  which  the  stress  intensity  varied  with  crack  length.  Similarly,  three- 
dimensional  plots  were  created  to  compare  how  stress  intensities  varied  with  angular  orientation 
for  cracks  of  a  specific  length  (Figures  27  through  34). 

Analysis  of  the  variation  of  Kt  with  crack  length  revealed  several  interesting  trends.  First, 
it  was  observed  that  in  all  cases,  the  stress  intensity  factor  at  the  faying  surface,  corresponding  to 
a  normalized  position  of  0.0,  increased  with  crack  length.  This  was  to  be  expected  from  basic 
fracture  mechanics  since  stress  intensity  factors  are  proportional  to  crack  length  in  a  given  stress 
state.  Also  of  interest  is  the  manner  in  which  the  stress  intensities  decrease  rapidly  in  the  upper 
portions  of  the  crack  front,  i.e.  near  the  countersink  surface,  for  many  of  the  orientations.  This 
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does  not  occur  in  the  10°  cracks,  which  are  most  nearly  normal  to  the  direction  of  the  hoop  stress, 
as  the  curve  maintains  a  characteristic  shape  for  all  crack  lengths.  At  this  orientation  (/?=  10  ), 
the  tensile  hoop  stress  is  sufficient  to  overcome  the  compressive  loads  created  by  the  pillowing 
displacement  (Figure  19).  However,  as  the  crack  orientations  become  more  nearly  parallel  to  the 
hoop  stresses,  the  drop  in  stress  intensities  near  the  upper  surface  of  the  countersink  becomes 
increasingly  pronounced.  This  is  consistent  with  the  compressive  stress  field  near  the  upper 
surface  surrounding  the  rivet  hole  as  shown  in  Figure  13.  For  cracks  oriented  at  30°  or  greater, 
the  stress  intensities  near  the  top  of  the  crack  front  go  to  zero  for  the  larger  cracks.  In  fact,  the 
numerical  analysis  of  these  cracks  resulted  in  negative  values  for  stress  intensity  factors,  which 
are  physically  impossible.  This  was  simply  the  result  of  the  analytical  crack  faces  being 
compressed  through  each  other  resulting  in  negative  crack  opening  displacements.  Under 
conditions  of  compressive  stress  that  would  cause  these  negative  displacements,  K,  has  no 
meaning.  Therefore,  any  negative  stress  intensity  values  were  set  to  zero.  Although  Figure  13 
reveals  only  the  surface  stresses,  the  extent  to  which  the  compressive  stress  field  extends  below 
the  surface  is  illustrated  by  the  crack  front  stress  intensity  factors  being  zero. 


Figure  20:  Stress  intensity  factors  vs.  normalized  crack  front  distance  for  cracks  oriented  at  20°. 
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Figure  21:  Stress  intensity  factors  vs.  normalized  crack  front  distance  for  cracks  oriented  at  30  . 


Figure  22:  Stress  intensity  factors  vs.  normalized  crack  front  distance  for  cracks  oriented  at  40  . 
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Examination  of  the  three-dimensional  plots  of  Ki  versus  crack  front  location  and  crack 
orientation  revealed  that  the  stress  intensity  factors  were  relatively  uniform  in  smaller  cracks  ( a- 
0.16)  over  the  range  of  angles  (Figure  27).  These  were  comer  cracks  confined  to  the  straight 
shank  portion  of  the  rivet  hole,  not  yet  having  broken  through  the  countersunk  surface.  The  first 
cracks  that  intersected  the  countersink  ( a-  0.32)  began  to  display  larger  Ki  variations  with 
respect  to  orientation.  This  can  first  be  seen  in  Figure  28.  Stress  intensity  factors  were  nearly 
uniform  across  the  faying  surface  for  all  angles  at  a-  0.32.  The  10°  crack  of  this  length 
maintained  the  same  Kj  distribution  as  seen  in  Figure  27,  although  magnified  due  to  the  increased 
crack  length.  The  qualitatively  similar  Kj  distributions  for  the  10°  crack  were  common 
throughout  the  range  of  crack  lengths,  with  the  overall  level  of  stress  intensity  increasing  as  a 
increased.  Other  crack  orientations,  particularly  in  the  range  from  30°  to  80°,  displayed  a  marked 
decrease  in  K/  toward  the  countersunk  surface.  Beginning  with  cracks  of  length  a=  0.79,  those 
with  orientation  angles  of  more  than  60°  developed  regions  of  zero  stress  intensities  near  the 
countersink.  As  the  crack  length  is  increased,  the  region  where  Kj  is  equal  to  zero  was 
encountered  by  cracks  of  just  over  20°  (Figure  34). 
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Figure  27:  Stress  intensity  factors  vs.  normalized  crack  front  distance  for  all  cracks  a=0.16. 


Figure  28:  Stress  intensity  factors  vs.  normalized  crack  front  distance  for  all  cracks  a-0.32. 
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Figure  29:  Stress  intensity  factors  vs.  normalized  crack  front  distance  for  all  cracks  a=0.48. 


Figure  30:  Stress  intensity  factors  vs.  normalized  crack  front  distance  for  all  cracks  a=0.63. 
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Figure  31:  Stress  intensity  factors  vs.  normalized  crack  front  distance  for  all  cracks  a=0.79. 


Figure  32:  Stress  intensity  factors  vs.  normalized  crack  front  distance  for  all  cracks  a=0.95. 
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Figure  33:  Stress  intensity  factors  vs.  normalized  crack  front  distance  for  all  cracks  a=l.ll. 


Figure  34:  Stress  intensity  factors  vs.  normalized  crack  front  distance  for  all  cracks  a=1.27. 
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A  comparison  of  stress  intensity  factors  at  the  faying  surface  for  the  smallest  cracks 
analyzed  ( a=  0.16),  reveals  that  the  10°  crack  had  the  lowest  value  of  Kh  This  suggests  that  if 
material  imperfections  were  present  all  along  the  rivet  hole  edge,  those  oriented  away  from  the 
normal  to  the  hoop  stress  would  be  more  likely  to  nucleate  cracks.  This  condition  would  explain 
the  absence  of  cracks  radiating  toward  the  next  adjacent  rivet  hole,  and  instead  propagating  into 
the  pillowed  region.  Corrosion  pillowing  alters  the  stress  state  to  such  an  extent  that  the  hoop 
stress  is  no  longer  the  driving  force  in  crack  initiation.  However,  it  should  be  noted  that  the 
stress  intensity  factors  for  cracks  at  10°  exceed  those  for  cracks  at  any  other  orientation  for  crack 
lengths  beyond  a=  0.952.  This  indicates  that  pre-existing  cracks  of  this  length  or  greater  at  an 
orientation  of  10°  would  tend  to  propagate  faster  than  those  at  other  orientations.  It  is  important 
to  remember  that  this  study  was  limited  to  a  linear  elastic  analysis.  This  dictated  the  maximum 
allowable  stresses,  and  consequently  the  maximum  pillowing  deflection.  An  elastic-plastic 
analysis  should  be  conducted  to  determine  the  effects  of  increased  corrosion  on  crack  nucleation 
sites. 

The  data  also  offer  some  explanation  for  the  high  aspect  ratio  cracks  on  lap  joint  faying 
surfaces.  High  stress  intensity  factors  at  the  faying  surface  indicate  that  cracks  will  propagate 
readily  in  that  region.  However,  as  the  stress  intensities  tend  to  decrease  and  eventually  go  to 
zero  in  the  compressive  stress  field  near  the  countersink  surface,  crack  growth  would  be  arrested 
before  becoming  through  cracks.  The  locus  of  points  where  stress  intensity  factors  go  to  zero 
defines  a  neutral  surface  separating  tensile  stress  near  the  faying  surface  and  compressive  stress 
near  the  countersink  or  outer  surface.  The  maximum  height  to  which  a  faying  surface  crack 
could  grow  would  be  limited  to  some  level  below  this  neutral  surface,  since  even  positive  stress 
intensities  are  not  sufficient  for  crack  propagation.  At  some  point  below  the  neutral  surface, 
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stress  intensities  could  reach  the  minimum  or  critical  stress  intensity  factor  at  which  crack  growth 
would  occur.  Again,  because  of  the  elastic  limitations  of  FRANC3D,  the  critical  stress  intensity 
factor  for  2024-T3  was  never  attained.  This  was  because  the  maximum  principal  stress  was 
located  outside  the  region  of  crack  investigation.  An  elastic-plastic  analysis  would  allow  for 
higher  levels  of  pillowing  that  could  cause  the  stress  intensity  at  some  point  to  reach  Klc. 

Attempts  to  model  the  propagation  of  planar  through  cracks  beyond  the  countersink  were 
unsuccessful.  Portions  of  the  crack  front  remained  in  regions  of  compressive  stress  beyond  the 
countersink  diameter,  which  can  be  seen  in  Figure  13.  This  resulted  in  the  calculation  of 
negative  values  for  K\  along  the  upper  segment  of  the  crack  fronts.  Since  negative  stress 
intensities  are  physically  impossible  and  would  result  in  the  extension  of  only  the  lower  segments 
of  the  cracks,  FRANC3D  was  unable  to  propagate  these  cracks.  Modeling  of  through  crack 
propagation  was  attempted  because  it  was  the  natural  extension  of  the  round  comer  cracks 
modeled  in  the  rivet  holes.  However,  given  the  findings  of  this  part  of  the  study  and  the  fact  that 
high  aspect  ratio  surface  cracks  are  more  commonly  found  in  actual  pillowed  lap  joints,  surface 
cracks  initiated  on  the  faying  surface  would  be  the  subject  of  future  studies. 

Concluding  Remarks 

It  was  the  aim  of  this  study  to  determine  the  characteristics  of  cracks  in  countersunk  rivet 
holes  under  the  conditions  imposed  by  corrosion-induced  pillowing  in  aircraft  fuselage  lap  joints. 
It  was  hoped  that  this  investigation  would  lead  to  a  better  understanding  of  cracks  observed  in 
operational  aircraft.  These  include  cracks  that  nucleate  at  orientations  other  than  normal  to  the 
hoop  stress  (Figure  4)  and  high  aspect  ratio  cracks  along  the  faying  surface  (Figure  3).  The 
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evidence  collected  tends  to  offer  explanations  to  both  of  these  phenomena,  and  can  be 
summarized  as  follows: 

1.  Stress  intensity  factors  for  small  cracks  (a=  0.16)  are  greater  at  orientations  other  than 
10°,  offering  more  favorable  crack  nucleation  sites  than  those  oriented  normal  to  the  hoop 
stress. 

2.  A  compressive  stress  field  exists  within  the  material  near  the  countersink  surface  and 
extends  beyond  the  diameter  of  the  countersink  into  the  pillowed  region.  This  prevents 
comer  cracks  from  breaking  through  the  upper  surface  and  promotes  the  high  aspect  ratio 
cracks  found  in  actual  pillowed  lap  joints. 

An  elastic-plastic  analysis  should  be  done  to  investigate  the  effects  of  higher  levels  of 
corrosion  and  the  resulting  increase  in  pillowing  deformation.  Increased  pillowing  would  almost 
certainly  create  greater  tensile  stresses  in  the  material  surrounding  the  rivet  hole.  The  resulting 
increase  in  stress  intensity  may  give  a  clearer  indication  of  why  cracks  tend  to  initiate  where  they 
do.  An  elastic-plastic  analysis  would  also  allow  the  stress  intensity  factors  to  reach  the  critical 
value  over  some  part  of  the  crack  front.  From  this,  the  region  susceptible  to  crack  growth  could 
be  determined  and  the  upper  limit  of  faying  surface  crack  growth  could  be  defined. 

For  crack  propagation  in  the  pillowed  region  to  be  successfully  modeled,  surface  cracks 
should  be  nucleated  along  the  faying  surface  at  various  locations,  not  necessarily  at  the  comer  of 
the  rivet  hole.  These  cracks  could  then  be  propagated  into  the  pillowed  region  or  toward  the  rivet 
hole  as  dictated  by  the  stress  intensities  along  the  crack  fronts.  This  method  of  crack  propagation 
could  be  done  with  a  FRANC3D  elastic  analysis  or  in  subsequent  elastic-plastic  analyses. 

The  stress  state  in  the  material  surrounding  the  rivet  hole  suggests  that  crack  propagation 
in  this  region  is  a  mixed  mode  phenomenon.  Future  studies  should  also  include  the  analysis  of 
Mode  II  and  possibly  Mode  HI  stress  intensities. 
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Abstract 

This  report  summarizes  the  continued  exploration  of  the  use  of  deconvolution  on  the  Doppler-Azimuth 
Spectrum  of  signals  collected  by  an  airborne  radar  array.  Deconvolution  is  applied  to  the  Azimuth-Doppler 
Spectrum  (ADS)  with  the  objective  of  emphasizing  the  information  associated  with  valid  targets,  while  de¬ 
emphasizing  unwanted  signal  components  such  as  irrelevant  background  returns,  or  clutter,  and 
interference  radiated  to  obscure  valid  radar  returns,  or  jammer.  Alternative  forms  of  implementation  for  the 
concept  of  deconvolution  are  explored  that  prove  less  computationally  intensive  than  the  Non-Negative 
Least  Squares  (NNLS)  method,  initially  proposed  for  this  purpose  in  the  preceding  Summer  Faculty 
Research  Program  project.  Van  Cittert’s  method  for  iterative  constrained  deconvolution  was  found  to  be  a 
viable  candidate  method.  Both  the  NNLS  and  Van  Cittert’s  method  have  been  taken  from  their  prototype 
implementations  in  an  interpreted  simulation  environment  (Matlab  ®),  to  portable  ANSI  C 
implementations.  The  advantages  and  disadvantages  observed  for  the  methods  in  a  number  of  spectral 
scenarios  studied  are  summarized. 
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Armando  B.  Barreto 


Introduction 

Modem  airborne  radar  systems  incorporate  a  number  of  radar  receiver  elements  to  enable  them  to  perceive 
the  angle  of  incidence  of  a  returned  signal.  So,  in  addition  to  the  determination  of  the  range  or  distance 
between  the  instrument  and  the  scatterer  and  the  relative  velocity  between  the  two,  detected  by  Doppler 
effect  [5][9][12],  the  data  collected  by  a  multi-element  radar  system  encodes  the  azimuthal  position  of  the 
scatterer[4].  This  added  information  can  be  used  to  further  distinguish  between  wanted  and  unwanted 
components  in  the  signals  received  by  the  radar  array.  For  example,  a  valid  moving  target  will  normally  be 
represented  by  a  return  of  unique  Doppler  shift  (indicative  of  the  relative  velocity  of  the  target  and  the  radar 
instrument),  from  a  unique  azimuth,  with  respect  to  the  broadside  of  the  radar  array.  In  contrast,  a 
broadband  or  barrage  jammer  signal  will  be  perceived  also  as  arriving  from  a  single  azimuth  ( the  azimuth 
of  the  jamming  source  with  respect  to  the  broadside  of  the  array),  but  it  will  be  sensed  at  many  Doppler 
frequencies.  Another  unwanted  component  in  the  signals  received  by  the  airborne  array  is  defined  by  the 
irrelevant  returns  associated  with  the  background  illuminated  by  the  radar,  or  clutter.  Because  there  may 
be  returns  from  background  all  around  the  radar  instrument,  this  form  of  interference  may  be  perceived  as  if 
arriving  to  the  instrument  from  all  azimuths.  Since  the  background  scatterers  are  static,  the  Doppler 
characteristic  of  clutter  returns  varies  as  function  of  azimuth,  which  is  determined  by  the  velocity  of  the 
radar  platform. 

Given  the  potential  differentiation  between  valid  target  returns,  clutter  returns  and  jamming  interference, 
the  signal  processing  challenge  is  to  develop  techniques  capable  of  emphasizing  the  target  data  and 
simultaneously  eliminate  or  attenuate  the  unwanted  signal  components.  Such  processing  scheme  will 
enhance  the  efficiency  and  reliability  of  radar  systems  to  detect  and  track  moving  targets. 
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Discussion  of  the  problem 


The  data  collected  by  the  radar  array  during  a  Coherent  Pulse  Interval  (CPI) ,  which  will  be  the  input  to  the 
radar  processing  algorithm,  is  normally  considered  as  organized  in  a  structure  known  as  the  ”CPI 
Datacube”.  For  a  radar  array  with  N  elements,  which  transmits  M  pulses  per  Coherent  Pulse  Interval,  and 
collects  signals  at  L  different  range  gates,  the  CPI  datacube  is  organized  as  shown  in  Figure  1. 


Figure  1.  The  CPI  Datacube  and  the  target  detection  process 
The  values  associated  with  a  single  range  gate  are  indicative  of  the  scatterers  at  a  single  distance  (range) 
from  the  radar  instrument.  This  “slice”  of  the  CPI  Datacube  is  called  a  “Space-Time  Snapshot”.  Figure  1 
indicates  that,  ultimately,  the  objective  of  the  data  processing  is  the  transformation  of  each  space-time 
snapshot  to  evaluate  some  target  parameter  indicator  and  determine  the  presence  of  a  valid  target  at  a  given 
range,  to  identify  its  azimuth  and  relative  velocity.  For  this  whole  operation  to  be  successful,  the  evaluation 
of  each  space-time  snapshot  must  be  really  representative  of  the  target  presence  alone.  If  the  data  in  the 
space-time  snapshot  contains  interference  components  (clutter  and  /  or  jammer)  the  target  detection  process 
will  be  misled. 


Evidently,  the  processing  applied  to  the  received  signals  must  attenuate  or  eliminate  the  interference  and 
emphasize  valid  target  components  in  each  space-time  snapshot.  This  type  of  filtering  is  in  fact  possible 
due  to  the  different  space-time  characteristics  of  jammer  and  clutter,  with  respect  to  the  target  [1]  [16]. 
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This  filtering  has  been  approached  through  the  implementation  of  a  “space-time  processor”,  which  is  a 
linear  filter  that  combines  all  the  samples  of  the  range  gate  of  interest  to  produce  a  scalar  output  [17]  : 

Z  =  wH  X  (1) 

where  %  is  a  the  space-time  snapshot  vector  (MN-by-1)  for  the  range  gate  of  interest,  and  w  is  an  MN-by-1 
weight  vector. 

In  turn,  an  appropriate  weight  vector  w,  that  will  attenuate  the  inteference  and  emphasize  the  target 
information  is  found  as: 

w  =  Ru'vt  (2) 

where  vt  is  the  space-time  steering  vector  for  the  target  (which  needs  to  be  proposed  repeatedly  to 
investigate  several  possible  azimuth  and  velocities  of  the  target)  and  R„  is  the  interference  covariance 
matrix: 

Here  represents  snapshots  ideally  formed  with  interference  data  only. 

In  the  filtering  approach  of  equation  (eq.  1)  the  removal  of  interference  contributions  to  the  output  indicator 
z  is  effected  through  w,  which,  in  turn  depends  on  the  data  used  to  calculate  Ru.  This  makes  the  space-time 
processor  data-adaptive,  which  enables  it  to  cope  with  the  changing  interference  scenarios  that  the  radar 
instrument  may  face.  This  is  the  basis  of  the  “Space-Time  Adaptive  Processors”  or  STAP  for  radar.  On  the 
other  hand,  the  required  estimation  of  Ru  and  continuous  upgrade  of  the  weight  vector  makes  this  “fully 
adaptive  STAP”  method  very  computationally  demanding,  and  susceptible  to  errors  in  the  estimation  of  Ra. 

The  effect  of  the  estimation  of  the  interference  covariance  matrix  (eq.  3)  and  its  inversion  to  partially 
determine  the  weights  of  the  space-time  processor  (eq.  2)  amount  to  the  identification  of  the  interference 
components  in  the  data  and  their  cancellation.  This  is  often  emphasized  by  the  display  of  the  two- 
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dimensional  Discrete  Fourier  Transform  (DFT)  of  both,  the  original  space-time  snapshot  showing  the 
interference  overpowering  the  target  signal,  and  the  “adapted  STAP  weights”,  which  shows  nulls  in  the 
regions  of  that  Doppler-Azimuth  Spectrum  (ADS)  where  the  interference  lies.  It  should  be  noted  then,  that 
the  STAP  approach  relies  on  the  identification  of  the  interference  components  to  achieve  their  cancellation. 

We  have  proposed  a  complementary  approach  to  the  STAP  method,  in  which  we  seek  to  identify  the 
signature  of  a  valid  moving  target  in  the  Azimuth-Doppler  Spectrum  and  isolate  it  from  the  rest  of  its 
components.  In  order  to  achieve  the  isolation  of  the  target  information  we  propose  that  the  ADS  signature 
of  a  valid  moving  target  is  determined  by  its  relative  velocity  and  angle,  which  defines  its  position  in  an 
abstract  ‘Target  Field”,  t ,  and  by  a  certain  two-dimensional  Impulse  Response,  H,  which  is  determined  by 
the  number  of  elements  and  pulses  per  CPI  of  the  radar  array.  The  details  of  our  deconvolution  approach 
to  target  characterization  are  given  in  the  next  section. 

Methodology 

Figure  2a)  illustrates  the  Azimuth-Doppler  Spectrum  (ADS)  resulting  of  the  simulation  of  a  moving  target 
at  an  azimuth  of  -30°  and  at  a  relative  speed  that  is  indicated  by  a  Doppler  shift  of  one  forth  of  the  Pulse 
Repetition  Frequency,  (PRF/4).  Figure  2b)  proposes  the  existence  of  a  two-dimensional  ‘Target  Field”, 
where  the  presence  of  a  moving  target  is  indicated  by  a  two-dimensional  delta  function  at  the  appropriate 
coordinates  of  azimuth  (x  axis)  and  relative  velocity  (y  axis).  It  should  be  noted  that  if  the  target  were 
simulated  at  different  azimuth  and  relative  speed  the  ADS  would  still  show  the  same  type  of  signature,  with 
a  main  lobe  representing  the  target  Doppler  and  Azimuth  and  sidelobes  extending  in  both  the  ADS 
dimensions.  Similar  to  the  case  in  Figure  2,  that  new  ADS  for  the  target  would  be  matched  by  a  single  two- 
dimensional  impulse  at  the  target  relative  velocity  and  angle  in  the  target  field.  Observations  like  these 
prompted  us  to  propose  that  the  Doppler-Azimuth  Spectrum  expression  of  a  valid  moving  target  can  be 
modeled  as  a  process  of  two-dimensional,  circular  convolution  between  the  abstract  target  field  and  a  basic 
target  “kernel”  or  impulse  response  determined  by  the  M  and  N  parameters  of  the  radar  system.  Figure  3 
illustrates  this  idea  for  a  case  where  N  =  M  =  6. 
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Figure  2.  A  moving  target  in  a)  the  Azimuth- Doppler  Spectrum,  and  b)  the  abstract  “Target  Field". 


Figure  3.  Relationship  between  the  Target  Field,  The  HNM{}  transformation  and  the  space-time 
spectrum  under  the  convolution  model. 
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In  this  process,  it  should  be  noted  that  the  “kernel”  or  two-dimensional  impulse  response  is,  in  fact,  the 
Azimuth-Doppler  Spectrum  corresponding  to  an  elementary  target  at  zero  relative  velocity  and  zero 
azimuth.  It  is  relevant  to  emphasize  that  the  convolution  model  proposed  for  the  generation  of  the  spectrum 
is  being  based  on  the  spectral  expression  of  an  elementary  target. 


If  the  existence  of  a  valid  target  in  the  target  field  is  reflected  in  the  observed  Azimuth-Doppler  Spectrum 
(ADS)  through  two-dimensional,  circular  convolution  with  the  known  impulse  response,  H,  it  may  be 
possible  to  revert  the  process  and  retrieve  corresponding  target  field,  indicating  the  relative  velocity  and 
angle  of  the  target,  through  deconvolution.  Furthermore,  since  the  convolution  model  proposed  is  based  on 
the  kernel  that  projects  an  elementary  target  to  the  Azimuth-Doppler  Spectrum,  the  associated 
deconvolution  process  will  preferentially  retrieve  the  spectral  components  due  to  targets,  minimizing  the 
retrieval  of  other  spectral  components  that  have  a  different  structure,  such  as  clutter. 


To  explore  the  possibilities  of  deconvolution  for  target  characterization,  we  first  express  the  discrete 
convolution  process  mathematically,  as  a  matrix  equation.  In  the  context  of  discrete  target  fields  and 
discrete  Doppler-Azimuth  Spectra,  both  of  them  with  finite  support ,  the  circular  convolution  operation  can 
be  put  in  terms  of  a  matrix  product  of  the  row-scanned  target  field,  pre-multiplied  by  an  appropriate  matrix 
H[i,j]  to  result  in  a  vector  representing  the  row-scanned  Azimuth-Doppler  Spectrum,  or  PSD: 

H<k2)x(K2)  t(K2xl)  =  PSD(K2,,)  (4) 

In  this  equation  K  is  the  assumed  number  of  discrete  values  for  both  dimensions  of  the  Target  Field  and  the 
space-time  spectrum.  The  [i,j]  entries  of  the  matrices  involved  in  the  above  equation  represent  the 
following: 

t[p(K-l)  +q,l]:  the  q*  element  of  the  p*  row  of  the  original  K  by  K  target  field  matrix 

PSD[p(K-l)  +q,l]:  the  q*  element  of  the  p111  row  of  the  original  K  by  K  ADS  matrix 
H[i,j]:  the  contribution  to  the  PSD  in  the  azimuth.Doppler  combination  represented  by 

the  [i,  1  ]  element  of  the  vector  PSD,  due  to  an  assumed  elementary  target  of  unit  power  located  at  the 
azimuth,  Doppler  combination  represented  by  the  [j,  1  ]  element  of  the  vector  t. 
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Expressing  the  convolution  process  in  the  form  of  equation  (4)  makes  it  possible  to  approach  the 
deconvolution  process  with  two  types  of  methods.  Initially,  changing  the  known  terms  of  equation  (4)  to  be 
H  and  PSD,  with  t  as  the  unknown  vector,  transforms  the  equation  into  a  set  of  K2  simultaneous  linear 
equations.  A  direct  solution,  if  one  exists,  is  likely  to  include  elements  of  t  that  are  negative.  This  would  not 
provide  a  valid  solution  for  the  target  field,  since  we  have  proposed  that  its  elements  represent  the  presence 
(t  >0)  or  absence  (t  =  0)  of  a  target  at  the  corresponding  Doppler  and  Azimuth.  Instead,  the  minimization  of 
the  norm  II  Ht  -  PSDII2,  subject  to  the  constraint  that  all  the  elements  of  t  must  be  positive  or  zero  can  be 
sought. 

Alternatively,  equation  (eq.  4)  can  be  used  in  iterative  methods  that  seek  to  minimize  an  error  or  residual 
between  the  observed  PSD  and  an  estimation  obtained  by  convolving  a  proposed  target  field  with  the 
known  kernel.  In  these  methods  the  residual  is  progressively  minimized  by  refining  the  target  field 
estimate  using  information  from  the  residual  obtained  in  the  previous  iteration.  In  these  methods  the 
positivity  of  the  target  field  result  is  promoted  by  applying  a  positivity  constraint  operator  on  the  current 
target  field  estimate  at  each  iteration: 

f  t(ni,  n2)  if  Pl<=  nl  <=  Ql,  P2  <=  n2  <=  Q2 
C[t(n,,  n2)  ]  =  ■!  and  t(n!;  n2)  >=  0  (5) 

1.0  otherwise 


The  mechanisms  for  two-dimensional  deconvolution  that  were  explored  in  these  project  were  as  follows: 

•  The  Non-Negative  Least  Squares  (NNLS)  method: 

The  Non-Negative  Least  Squares  (NNLS)  minimization  algorithm  of  Lawson  and  Hanson  [6]  implements 
the  kind  of  norm  minimization  required  to  solve  equation  (eq.  4)  for  the  target  field,  t. 

x  =  min  UH/-PSDI12  such  that  x  >=  0  (6) 

t 

It  develops  recurrent  solutions  of  increasing  order,  checking  at  each  order  that  the  solution  contains  only 
values  above  a  certain  tolerance  level  (close  to  zero).  If  the  solution  at  any  particular  order  is  not 
completely  non-negative  it  is  regenerated  until  it  satisfies  the  constraint.  The  non-negative  solutions 
continue  to  be  generated  until  they  reach  the  true  order  of  the  problem. 
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The  NNLS  deconvolution  approach  was  the  one  used  during  the  Summer  Faculty  Research  Program  project 
preceding  this  SREP  project.  It  has  been  found  to  result  in  recovered  target  fields  that  are  comprised  by 
delta-like  components,  which  conforms  to  our  assumptions  for  the  target  field.  This  method,  however,  is 
susceptible  to  reproduce  small  and  isolated  target  field  components  as  a  response  to  irregularities  in  the 
Azimuth  -Doppler  Spectrum.  Additionally,  although  its  convergence  in  a  finite  number  of  steps  is 
guaranteed  [6],  the  time  required  by  this  method  is  data-dependent  and  may  be  longer  than  others. 


•  The  Richardson-Lucy  Methods  for  iterative  deconvolution: 

The  fundamental  aspects  of  this  iterative  deconvolution  method  were  developed  independently  by 
Richardson  [10]  and  Lucy  [7]  independently  Their  approaches  have  been  grouped  together  because  they 
are  both  derived  from  Bayes  theorem,  both  treat  the  Target  Field  t(i,  j),  Impulse  Response  H(i,  j),  and 
Doppler- Azimuth  Spectrum  PSD(i,  j)  as  probability  frequency  functions.  The  only  difference  in  their 
application  is  the  method  of  iteration.  Richardson’s  method  is  directly  presented  as  six  nested  loop 
summations  (four  are  shown  below,  the  two  missing  are  needed  for  running  indeces  i  and  j)  and  an  outer 
loop  for  each  iteration,  r  [  10].  The  nested  loops  make  this  approach  extremely  time  consuming. 

»■  ;> ■  »,«■  nit  > pf(m'nWm - j* ‘> 

m=/  n=;  £  S  tr  ( P.q)H (rn  -  p  + 1 ,  n  -  q  +  V)  ™ 

p-a  q-c 


Where: 


it 


PSD(m,n)H (m-i  +  \,n-  j  +  1) 

b  d 

-  p  +  \,n-q  +  \) 


p=a 


a  =  (l,m-K  +  \)^  b  =  (m,  I)mn 
c  =  (l,n-L  +  l)nux  d  =  (n,J)mn 
c  —  i  +  K  —  1  j  —  j  +  L~ I 
/  =  {U}  f={U) 

K,L  are  the  dimensions  of  Hu 
I,J  are  the  dimensions  of  ttj 


(8) 


The  slow  execution  of  this  method  and  the  unimpressive  results  obtained  through  it  in  preliminary  attempts 
to  deconvolve  the  Azimuth-Doppler  Spectrum  prompted  us  not  to  continue  investigating  it. 
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On  the  other  hand,  Lucy’s  method  is  implemented  with  just  one  loop  executing  the  iterations  and  a  series  of 
matrix  multiplications,  where  ®  indicates  circular  convolution,  and  PSD  is  the  original  (“observed”)  PSD, 
while  PSDr  is  the  estimation  of  the  spectrum  at  iteration  r  [7], 


PSD  _  uT 

t  r+\  =  tr  - ®  H 

r  1  PSDr 


PSDr=H®tr 


(10) 


The  circular  convolution  is,  in  practice,  implemented  by  applying  the  current  target  field  and  PSD  vectors  to 
equation  (eq.  4). 


This  method  was  significantly  faster  than  Richardson’s.  On  the  other  hand,  when  applied  to  the 
deconvolution  of  simple  simulated  Doppler- Azimuth  Spectra,  Lucy’s  method  did  not  appear  to  synthesize  a 
target  field  with  clear  indications  for  the  azimuth  and  Doppler  of  the  targets. 


•  Van  Cittert’s  Method  for  Iterative  Deconvolution: 

This  is  a  very  simple  iterative  deconvolution  method  that  strives  to  minimize  the  residual  by  feeding  it  back 
to  upgrade  the  target  field  estimate: 

tr+](iJ)  =  tr(iJ)  +  aRr(i,j)  (11) 

Rr(i,  j )  =  PSD(i,  j)-H  ®  tr(i,  j)  02) 

This  process  iterates,  either  for  a  given  number  of  iterations,  /"max  or  until  the  data  in  the  residual  matrix  is 
considered  negligible!  15]. 

This  method  allows  control  over  the  number  of  iterations  to  apply,  which  determines  the  total  number  of 
calculations  to  execute.  We  have  found  that  100  iterations  is  normally  an  appropriate  number  for  the 
deconvolution  of  Azimuth-Doppler  spectra.  Under  these  conditions  Van  Cittert’s  method  is  one  of  the 
fastest  means  to  execute  the  deconvolution  of  the  ADS.  On  the  other  hand,  this  method  normally  results  in 
target  field  components  that  are  less  delta-like  than  those  from  NNLS  deconvolution.  Figure  4  shows  the 
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target  fields  resulting  from  the  deconvolution  of  the  same  Doppler-Azimuth  Spectrum,  through  the  NNLS, 
Lucy’s  and  Van  Cittert's  deconvolution  methods.  Because  of  their  characteristics  of  accuracy  and  speed, 
the  NNLS  and  Van  Cittert’s  methods  were  chosen  for  further  study. 


a)  Azimuth-Doppler  Spectrum  for  a  b)  Deconvolution  of  a),  through  Lucy’s  Method 

single  target 


c)  Deconvolution  of  a)  through  the 
NNLS  method 


d)  Deconvolution  of  a)  through 
Van  Cittert’s  methrod 


Figure  4.  The  Azimuth-Doppler  Spectrum  of  a  single  elementary  target  without  interference  and  the  results 

of  its  deconvolution  through  three  different  methods. 
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In  investigating  the  performance  of  the  different  deconvolution  methods,  it  was  found  that  the  NNLS 
approach  yielded  the  most  delta-like  target  indicators.  However,  the  processing  time  for  NNLS  tends  to  be 
longer  than  for  other  methods  and  it  is,  in  fact,  data-dependent.  The  dependency  of  the  NNLS  computation 
time  on  the  specific  data  being  processed  is  due  to  the  varying  need  to  reject  solutions  with  negative  or  very 
small  components,  when  they  are  tested  and  do  not  surpass  a  given  tolerance  level,  preset  in  the  NNLS 
algorithm.  Initially,  it  was  hoped  that  relaxing  the  tolerance  parameter  in  the  algorithm  could  accelerate  the 
convergence,  even  if  this  resulted  in  the  appearance  of  small  negative  target  field  components,  which  would 
be  neglected. 

Unfortunately  the  variation  of  the  tolerance  parameter  over  a  wide  range  of  values  did  not  result  in  any 
appreciable  improvement  of  the  speed  for  NNLS.  The  following  table  lists  the  computation  time  for  the 
deconvolution  of  a  3  l-by-3 1  Azimuth- Doppler  Spectrum,  using  a  tolerance  parameter  that  was  modified  by 
the  factors  shown  from  the  default  value  assigned  by  Matlab  ®.  This  value  is  determined  by  Matlab  ®  as 

tolerance  =  max(size(A))  *  norm(A,\)  *eps  (13) 

where  eps  is  a  machine-dependent  accuracy  parameter  that  represents  the  distance  between  1  and  the  next 
largest  floating-point  number.  The  tests  were  run  in  a  200  Mhz  R4400-based  SGI  Challenge  machine,  for 
which  eps  =  2.2204e-016. 


Tolerance  Factor 

Comput  Time  (min) 

Variable  name 

0.0001 

0.90421 

nctf_0001 

0.001 

0.94051 

nctf_001 

0.01 

0.94018 

nctf_01 

0.1 

0.9368 

nctf_l 

1 

0.93762 

nctfl 

10 

0.93963 

nctf_10 

100 

0.9395 

nctf_100 

1000 

0.93834 

nctf_1000 

Table  1 .  NNLS  Computation  time  for  different  Tolerance  Factors,  on  the  same  ADS  for  1  simulated  target. 
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As  observed  in  Table  I,  the  variation  of  the  default  NNLS  tolerance  by  factors  ranging  from  0.001  to  1000 
did  not  cause  any  significant  reduction  in  the  execution  speed,  for  this  particular  problem.  The  recovered 
target  field  was  very  similar  in  all  cases  and  resembled  the  one  NNLS  result  shown  in  Figure  4. 

The  convolution  model  proposed  for  the  transition  from  a  target  field  to  the  corresponding  Azimuth- 
Doppler  Spectrum  was  based  on  the  assumption  of  an  “impulse  response”  or  kernel,  which  specifically 
resulted  in  the  spectral  expression  of  a  valid  target  in  the  ADS.  The  capability  of  the  deconvolution 
method  to  minimize  the  impact  of  other  spectral  components  depends  on  the  difference  of  the  spectral 
structure  of  interference  elements  with  respect  to  the  structure  of  a  valid  target.  The  following  figures 
display  Azimuth-Doppler  Spectra  obtained  from  the  “MountainTop  Toolbox”  created  by  Yaron  Seliktar 
from  The  Georgia  Institute  of  Technology  [11].  This  custom  Matlab  ®  toolbox  provides  a  program  and  a 
front-end  for  the  simulation  of  multi-element  radar  array  signals  that  incorporate  returns  from  a  target, 
clutter  and  jammer,  allowing  the  user  to  specify  the  characteristics  of  the  simulated  components  and  the 
operating  parameters  of  the  radar.  (All  the  simulated  data  for  this  project  were  generated  with  the 
“MountainTop  Toolbox”,  slightly  customized  for  our  purposes). 

Figure  5  represents  the  power  signature  of  an  elementary  target  simulated  at  0  dBs,  at  an  angle  of  20 
degrees  with  respect  to  the  array  broadside  and  with  a  Doppler  frequency  of  -200  Hz.  Figure  6  represents 
the  ADS  for  simulated  clutter  at  40  dBs,  assuming  a  platform  speed  of  100  m/s.  Finally,  Figure  7 
represents  simulated  barrage  jammer  at  OdBs  and  from  an  azimuth  of  30  degrees  with  respect  to  the 
broadside  of  the  array.  It  should  be  noted  that,  in  fact,  the  isolated  target  appears  represented  in  the  ADS  by 
a  pattern  similar  to  the  ones  considered  before,  except  that  reflecting  the  16  pulses  per  CPI  (M=16)  and  the 
14  array  elements  (N  =  14)  assumed  in  the  simulation.  On  the  other  hand,  the  spectral  structure  of  clutter 
displays  the  characteristic  “clutter  ridge”,  which  is,  in  fact,  very  different  from  the  spectral  structure  of  a 
target.  From  this  observation,  we  can  expect  to  apply  the  process  of  deconvolution  to  retrieve  target 
components  preferentially  over  clutter  components. 
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While  the  spectral  structure  of  the  clutter  interference  is,  in  general,  different  from  that  of  a  target,  the 
jammer  interference  appears  in  the  ADS  as  a  high  ridge  at  a  constant  azimuth,  spanning  all  the  Doppler 
frequencies.  Unfortunately,  we  have  found  in  our  preliminary  work  with  ADS  deconvolution  that  the 
jammer  spectrum  is,  in  fact,  retrieved  as  a  series  of  components  in  the  target  field,  all  lined  up  at  the  same 
azimuth.  To  prevent  this  effect  from  dominating  the  recovered  target  field,  we  found  it  necessary  to  apply  a 
pre-processing  scheme  to  remove  the  jammer  components  before  submitting  the  ADS  to  the  deconvolution 
operation.  In  this  way  the  pre-processor  will  attenuate  the  jammer  interference  first  and  then  the 
deconvolution  process  will  preferentially  retrieve  the  true  target  components  of  the  ADS. 

Attenuation  of  the  jammer  interference  only  can  be  accomplished  by  a  process  that  filters  in  the  spatial 
dimension  only.  This  means  that  the  pre-processor  acts  only  on  data  collected  simultaneously  by  all  the 
array  elements.  Since  the  azimuth  of  incidence  of  the  jammer  will  very  likely  change  through  time,  the 
spatial  “notch”  imposed  by  the  desired  pre-processor  for  its  cancellation  must  also  be  defined  in  a  data- 
adaptive  manner.  In  fact,  the  equations  that  govern  the  performance  of  an  adaptive  spatial  filter  for  jammer 
cancellation  will  have  the  same  structure  as  equations  (eq.  1),  (eq.  2)  and  (eq.  3),  except  that  this  time  they 
manipulate  a  vector  with  only  the  samples  collected  by  the  N  array  elements  at  a  single  sampling  time,  the 
spatial  snapshot :  X. 

Here  the  output  indicator  for  the  spatial-only  filter  would  be: 

ZS=WHX  (14) 

In  this  case,  the  N  weights  needed  for  the  spatial  filter  are  also  based  on  the  interference  covariance  matrix, 
Rs  and  a  target  spatial  steering  vector  vs : 

w=Rs'vs  (15) 

Fortunately,  the  interference  covariance  matrix  for  the  space-only  filter  is  estimated  from  the  spatial 

snapshot  only,  of  size  N  [2]  [3]: 

(16) 

In  practice  the  expectation  operator  in  the  above  equation  is  implemented  as  a  time-average: 

«.  -iZx,X,"  (17) 

A  ,=I 
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This  estimate  of  the  spatial  interference  covariance  matrix  would  next  have  to  be  inverted.  This  yields  the 
name  “Sample  Matrix  Inversion”,  or  SMI,  for  this  form  of  adaptive  filtering.  In  practice,  however,  it  has 
been  found  that,  in  order  to  improve  the  condition  number  of  the  estimated  Rj  ,  it  is  advantageous  to  add  a 
constant  term,  fl,  to  all  the  elements  in  its  diagonal.  This  method  is  called  Sample  Matrix  Inversion  “with 
Diagonal  Loading”  and  has  been  shown  to  also  yield  an  antenna  pattern  with  smaller  side  lobes  [2]  [3]. 

In  our  pre-processor,  we  applied  SMI  with  Diagonal  Loading  (P  =  0.01)  to  generate  an  estimate  of  Rs’'. 
Then  this  matrix  was  applied  to  the  data  in  the  CPI  under  study  without  using  a  spatial  steering  vector.  If 
we  isolate  the  effect  of  the  steering  vector  in  the  determination  of  the  output  indicator  zs ,  we  realize  that  an 
equivalent  vector,  X’,  from  which  the  interference  has  been  removed,  without  emphasizing  a  specific 
“look-direction”  can  be  found: 

z,  =  wHx  =  (r;'vs)h  x  =  vsh«r;')h  x)  =  vsh(xhr;')h  =  vsHx,H  us) 

where 

x'=xhr;'  d9) 

is  the  spatially-filtered  spatial  snapshot. 

In  the  experimental  cases  where  a  jamming  component  was  simulated  or  known  to  exist,  we  used  the  above 
procedure  as  a  pre-processing  stage,  substituting  all  the  spatial-snapshots  X  in  the  CPI  under  study  by  the 
corresponding  filtered  X’. 
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Results 


The  ability  of  the  deconvolution  process  to  preferentially  retrieve  target  information  in  the  presence  of 
interference  components  was  demonstrated  by  its  application  to  radar  data  of  several  types.  In  this  report 
some  cases  representative  of  the  following  categories  are  shown: 

a)  Synthetic  data  (generated  through  the  “MountainTop  Toolbox”  [11]),  including  the  simulation  of  target 
and  clutter. 

In  this  case  the  deconvolution  approaches  were  directly  applied  to  the  simulated  data. 

b)  Synthetic  data  including  the  simulation  of  a  target,  clutter  and  jamming  interference. 

In  this  case  the  “SMI  with  diagonal  loading”  approach  was  used  first,  as  a  pre-processing  stage, 
and  then  the  deconvolution  method  was  applied. 

c)  Real  data  collected  through  the  DoD  “Mountaintop  Program”  [  1 3][  1 4],  and  retrieved  from  the 
Common  Research  Environment  for  STAP  Technology  (CREST)  Datalibrary. 

In  all  cases  both  the  NNLS  and  Van  Cittert’s  methods  for  constrained  deconvolution  were  applied,  for 
comparison. 


a)  Deconvolution  of  synthetic  data  with  simulated  clutter  and  target. 

During  the  Summer  Faculty  Research  Program,  the  possibility  of  preferentially  retrieving  the  target  ADS 
spectral  components,  over  the  components  due  to  clutter  was  first  examined  in  simulated  cases  with  ADS  at 
at  resolution,  K  =  31.  (That  is,  the  ADS  was  represented  by  a  matrix  of  size  31  x  31).  This  was  needed 
since  the  interpreted  Matlab  ®  program  implementing  NNLS  deconvolution  was  relatively  slow  and  using 
larger  resolutions  would  have  made  the  processing  time  too  long.  Within  those  constraints,  the  simulations 
were  carried  out  assuming  6  antenna  elements  (N=6)  and  6  pulses  per  CPI  (M  =  6).  The  two-dimensional 
impulse  response  (or  “kernel”)  for  these  simulation  conditions  is  shown  in  Figure  8.  In  that  preliminary 
study  only  the  NNLS  deconvolution  method  was  used,  with  encouraging  results.  Figure  9  shows  a  31  x  31 
synthetic  ADS  were  a  target  was  simulated  with  a  power  of  0  dB,  at  -30°  Azimuth  and  150  Hz  Doppler. 
This  same  ADS  includes  a  simulated  clutter  component  at  30  dB  power,  for  which  the  platform  velocity 
has  been  assumed  at  100  m/s  .  This  clutter  ridge  can  be  seen  distorting  the  spectral  pattern  of  the  target 
along  a  diagonal  “ridge”,  as  it  is  typical  for  this  form  of  interference. 
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Figure  8.  2D-  Impulse  Response  for  a  system  with  N=M=6,  K  =31 


Figure  9.  ADS  for  a  simulated  target( 0  dB,  -3(f ,  150  Hz)  and  clutter  (30  dB,  100  m/s),  K  =  31 


The  results  obtained  by  applying  NNLS  deconvolution  to  this  ADS  are  shown  below,  in  Figure  10  .  The 
retrieved  target  field  is  almost  uniquely  defined  by  a  delta  function  at  the  velocity  (Doppler)  and  angle 
(azimuth)  at  which  the  target  was  simulated,  except  for  some  small  adjacent  peaks.  It  is  believed  that  those 
small  spurious  components  result  from  the  slight  configuration  mismatch  between  the  spectrum  of  the  target 
and  the  kernel  used  for  the  deconvolution,  which  is,  in  turn,  a  result  of  the  limited  resolution  of  the  ADS. 
The  clutter  ridge,  however,  has  not  been  represented  in  the  target  field. 


Figure  10.  Target  Field  recovered  from  the  ADS  in  Figure  9,  through  NNLS 
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The  results  of  applying  Van  Cittert’s  iterative  constrained  deconvolution  over  100  iterations,  with  a  a  rate 
factor  a  =  lxlO'6,  to  the  same  ADS  are  shown  in  Figure  11.  These  results  indicate  that  the  residual 
minimization  of  Van  Cittert’s  method  yields  a  target  indication  that  is  not  a  single  delta  function.  Instead, 
the  target  field  converges  to  a  target  representation  that  spans  a  few  cells  in  the  target  field,  around  the 
target  characteristics  (azimuth,  Doppler).  This  feature  of  the  Van  Cittert’s  method  was  verified  through 
subsequent  applications  to  other  data.  Although,  in  principle,  this  type  of  result  is  not  as  specific  as  the  one 
potentially  offered  by  NNLS,  Van  Cittert’s  method  was  found  to  be  generally  faster  than  NNLS 
deconvolution.  For  the  particular  example  considered  above,  Van  Cittert’s  deconvolution  took  less  than  one 
tenth  of  the  time  required  by  NNLS  deconvolution.  In  presenting  this  timing  comparison,  it  must  be  kept  in 
mind  that  Van  Cittert’s  timing  is  completely  defined  by  the  size  of  the  ADS  matrix,  K,  and  the  number  of 
iterations  allowed.  On  the  other  hand  NNLS  timing  is  influenced  by  the  size  of  the  ADS  matrix,  but  then  it 
depends  significantly  on  the  nature  of  the  data  itself,  which  determines  how  many  times  the  inner  loop  in 
the  algorithm  needs  to  be  repeated  for  each  progressively  bigger  solution  order. 


Figure  1 1.  Target  Field  recovered  from  the  ADS  in  Figure  9,  through  Van  Cittert’s  method 

When  the  power  level  of  the  synthetic  clutter  was  raised  to  40  dB,  some  of  the  concentrated  clutter  patches 
reached  levels  that  made  them  resemble  the  target  power  configuration.  This  caused  the  retrieval  of  spurious 
peaks  in  the  recovered  target  fields,  in  addition  to  the  ones  associated  with  the  true  target.  From  the  results 
shown  in  the  figures  below,  it  can  be  realized  that  NNLS  deconvolution  is  more  susceptible  to  these  effects 
than  Van  Citterf  s.  The  latter,  in  fact,  only  reflected  a  significant  spurious  peak  at  the  comer  of  the  ADS 
where  the  clutter  ndge  ends.  The  structure  of  the  ADS  in  this  area  shows,  in  effect,  a  partial  main  lobe  with 
smaller  lobes  around  it.  which  are  perceived  by  the  deconvolution  methods  as  its  side  lobes. 
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Once  both  deconvolution  methods  were  implemented  in  (compiled)  ANSI  C,  it  was  possible  to  experiment 
with  better  ASD  resolutions,  at  a  reasonable  execution  time.  This  step  was  necessary  in  order  to  perform 
deconvolution  on  Azimuth-Doppler  spectra  simulated  with  more  realistic  number  of  elements  ,  N,  and 
pulses  per  CPI,  M.  Our  next  level  of  experimentation  was  with  ADS  matrices  of  size  48  x  48  and  N  and  M 
parameters  which  resemble  those  for  the  radar  system  used  in  the  Mountaintop  data,  (N=14,  M  =  16).  The 
figure  below  shows  the  48  x  48  ADS  resulting  from  simulating  the  same  scenario  as  before  (one  target  at 
OdB,  -30°  Azimuth,  150  Hz  Doppler,  clutter  at  30  dB  power,  platform  velocity  100  m/s)  but  assuming  N=14 
and  M=16. 


Figure  15.  ADS  for  a  simulated  target(0  dB,  -3<f  ,  150  Hz)  and  clutter  (30  dB,  100  m/s),  K  =  48 


The  target  field  obtained  through  Van  Cittert’s  deconvolution  is  shown  below:  The  target  information  is 
recovered  and  the  clutter  components  are  virtually  disregarded. 


Figure  16.  Target  Field  recovered  from  the  ADS  in  Figure  15,  through  Van  Cittert’s  Deconvolution 
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The  results  obtained  through  NNLS  deconvolution  are  shown  in  the  next  figure.  In  addition  to  the  target 
field  components  associated  with  the  target  .  this  result  includes  groups  of  smaller  components  at  the  ends 
of  the  clutter  ridge  and  at  the  intersections  of  it  with  the  side  lobes  of  the  target  spectral  expression.  The 
power  contributions  of  both  target  (side  lobes)  and  clutter  ridge  are  superimposed  in  those  intersections, 
which  makes  that  region  of  the  ADS  resemble  a  target  main  lobe.  This  effect  is  reinforced  by  the  fact  that 
the  traget  side  lobes  and  clutter  ridge  approximate  the  structure  that  the  side  lobes  of  a  mainlobe  in  the 
intersection  would  have. 


Normalized  Doppfer  (Hz)  **  '  ^  sriaznuth) 


Figure  17.  Target  Field  recovered  from  the  ADS  in  Figure  15,  through  NNLS 


b)  Deconvolution  of  synthetic  data  with  simulated  target,  clutter  and  jammer 

To  verify  the  efficiency  of  the  proposed  pre-processor  for  jamming  attenuation,  a  data  set  was  simulated 
containing  a  target  at  0  dB.  -30°  azimuth,  150  Hz  Doppler,  along  with  clutter  at  0  dB  (platform  velocity 
100  m/s)  and  barrage  jammer  at  30  dB  ,  from  an  azimuth  of  30°.  In  this  first  case  the  attenuation  of  the 
jammer  with  low-power  clutter  only  is  studied. 

The  following  two  figures  show  the  original  simulated  ADS  and  the  ADS  of  the  data  after  SMI  spatial 
filtering,  with  covariance  matrix  diagonal  loading.  It  should  be  noted  that  the  jammer  component,  which 
overwhelms  the  original  ADS  has  been  effectively  “notched  out  after  SMI  spatial  filtering.  In  this 
intermediate  result  the  power  representations  of  the  target  and  the  clutter  are  now  detectable. 
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Figure  21.  Target  Field  recovered  from  the  ADS  in  Figure  19,  through  Van  Cittert’s  Deconvolution 


Both  approaches  are  capable  of  retrieving  the  target  component,  NNLS  generates  some  small  peaks  around 
the  actual  target  characteristics  (velocity  and  azimuth).  Van  Cittert’s  results  retrieve  only  the  information  of 
the  target,  disregarding  the  clutter  components. 

The  complete  process  involving  the  SMI  pre-processor  and  the  deconvolution  approaches  were  tested  also 
with  a  scenario  similar  to  the  previous  one  in  which  the  power  of  the  clutter  has  been  increased  to  30  dB. 
The  original  ADS,  the  intermediate  result  after  the  pre-processor  and  the  final  results  obtained  from  Van 
Cittert’s  and  NNLS  are  shown  in  the  following  figures. 


Figure  22..  ADS  for  simulated  target(0  dB,  -3<T,  150  Hz),  clutter  (30  dB,  100  m/s)  &  jammer  (30  dB,  3(f) 
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Figure  25.  Target  Field  recovered  from  the  ADS  in  Figure  23,  through  Van  Cittert's  Deconvolution 

In  this  case  Van  Cittert’s  results  indicate  the  characteristics  of  the  target  but  also  display  a  component  that 
reflects  the  end  of  the  clutter  ridge,  as  explained  before.  On  the  other  hand,  the  distortion  introduced  in  the 
clutter  components  by  the  SMI  spatial  filtering  affects  NNLS  more  severely,  causing  it  to  represent  small 
elements  in  the  intersections  of  the  target  side  lobes  and  the  clutter  ridges. 
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c)  Deconvolution  of  real  data  collected  through  the  DoD  “Mountaintop  Program”,  and  retrieved  from  .the 
Common  Research  Environment  for  STAP  Technology  (CREST)  Datalibrary. 

A  number  of  files  from  the  CREST  Datalibrary  were  processed  for  target  isolation.  These  files  were 
collected  with  the  Radar  Surveillance  Technology  Experimental  Radar  (RSTER)  system,  employing  14 
receiving  elements  (N=14)  and  with  16  pulses  per  CPI  (M=16).  The  example  presented  here  is  the  data  in 
file  “stap  1004vl.l030,  cpi  3724,  range  bin  20”.  In  this  data  set  the  Inverse  Displaced  Phase  Center  Array 
(IDPCA)  technique  was  used  to  emulate  the  movement  of  an  airborne  platform  with  a  stationary  radar 
system,  spreading  the  spectral  manifestation  of  the  clutter  in  the  Doppler  dimension  [8].  In  this  real  data 
set,  however,  the  intensity  of  the  clutter  ridge  is  not  uniform,  with  patches  at  about  -20  and  60  azimuth 
being  stronger  scatterers.  A  moving  target  simulator  (MTS)  had  been  set  up  to  emulate  a  target  at  about  -5  . 
and  about  100  Hz  Doppler,  very  close  to  one  of  the  strong  background  scatterers.  In  fact  the  figure  below 
shows  how  the  clutter  patch  and  the  MTS  are  almost  integrated  in  the  original  Azimuth-Doppler  spectrum. 


Snapshot  Power  Spectrum:  (cpi  1,  bin  20) 


sin(azimuth) 

Figure  26.  Azimuth-Doppler  Spectrum  for  CREST  file:  STAP 1004vl.  1030.  CPI3721 ,  range  bin  20 


The  following  figures  show  the  results  of  NNLS  and  Van  Cittert’s  deconvolution  of  the  ADS  shown  above. 
Van  Cittert’s  deconvolution  retrieves  the  target,  showing  a  single  smaller  peak  derived  from  the  strong 
clutter  patch,  but  in  this  target  field  both  nonzero  elements  are  clearly  separated,  and  the  target  inication  is 
predominant.  NNLS  generated  a  target  indication  but  it  also  developed  a  number  of  smaller  components, 
including  one  for  the  second  patch  of  strong  clutter  at  about  60°  Azimuth. 
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Discussion 


The  previous  section  showed  the  application  of  the  deconvolution  process  to  progressively  more  realistic 
scenarios  for  the  Azimuth-Doppler  Spectrum  of  simulated  and  real  array  radar  signals.  In  all  cases  the 
results  obtained  from  two  different  forms  of  implementation,  Van  Cittert’s  and  NNLS  were  shown  and 
compared. 


In  the  simplest  case,  in  which  idealized  signal  elements  (target  and  clutter)  were  simulated  at  a  low  spectral 
resolution  (K  =  3 1 ),  for  a  system  with  only  6  array  elements  and  6  pulses  transmitted  per  CPI  (N  =  M  =  6) , 
both  methods  were  able  to  successfully  retrieve  the  information  of  the  target,  disregarding  the  spectral 
components  related  to  the  clutter  interference.  In  these  cases  the  NNLS  method  of  deconvolution  seemed  to 
provide  a  more  specific  characterization  of  the  target,  with  respect  to  its  Doppler  and  azimuth,  since  the 
recovered  target  fields  would  only  involve  one  or  two  non-zero  values,  in  each  case.  Van  Cittert’s  method 
of  iterative  constrained  deconvolution,  on  the  other  hand,  resulted  in  target  fields  were  a  thin  two- 
dimensional  pulse  would  be  observed,  representing  the  target.  Although  this  pulse  was  composed  of  more 
than  two-nonzero  target  field  samples,  its  shape  was  still  indicative  of  the  proposed  azimuth-Doppler 
characterization  of  the  target,  particularly  in  the  contour  map  of  the  recovered  target  field. 


While  maintaining  the  rest  of  the  simulation  parameters  unchanged  (K  =  31,  M  =  N  =  6),  the  degradation  of 
the  recovered  target  field  in  the  presence  of  progressively  larger  levels  of  clutter  power  was  observed.  The 
emergence  of  smaller  target  field  components  at  locations  other  than  that  of  the  true  target  was  realized. 
Studying  these  artifactual  target  field  components,  they  were  found  to  occur  in  association  with  areas  of 
the  Azimuth-Doppler  Spectrum  that  presented  a  structure  that  resembled  that  of  a  valid  target.  Typically, 
these  areas  occurred  at  points  in  the  Azimuth-Doppler  Spectrum  where  the  clutter  ridge  intersects  other 
spectral  elements,  such  as  the  side  lobes  of  the  target  spectral  structure,  or  at  the  extremes  of  the  ridge, 
particularly  if  they  were  at  a  comer  of  the  Azimuth-Doppler  Spectrum.  In  the  case  of  target  side  lobe 
intersections  with  the  clutter  ridge,  the  local  superposition  of  power  generates  what  resembles  a  target  main 
lobe,  and  the  elements  intersecting  at  that  Azimuth-Doppler  combination  appear  to  play  the  role  of  the  side 
lobes  for  the  artifactual  "main  lobe”.  In  the  case  of  the  extremes  of  the  clutter  ridge,  if  there  is  a  significant 
accumulation  of  power  at  the  comer  of  the  Azimuth-Doppler  Spectrum  the  process  of  deconvolution  seems 
to  resolve  the  missing  information  for  those  power  structures  (i.e.,  the  missing  %  of  the  surrounding 
spectrum)  in  favor  of  identifying  them  as  matches  with  the  kernel  used  for  the  deconvolution  (the  spectrum 
of  an  elementary  target). 

It  should  be  stressed  that  the  understanding  gained  of  the  possible  generation  of  secondary  components  in 
the  target  field  retrieved  through  deconvolution  can  be  used  to  develop  post-processing  mechanisms  to 
sort-out  those  components  that  are  likely  to  be  artifactual.  For  example,  if  a  larger  target  field  component  is 
identified,  smaller  components  along  that  same  azimuth  and  Doppler  values  could  be  suspected  as  being 
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artifactual.  The  highly  simplified  structure  of  the  target  field,  with  respect  to  the  original  Azimuth-Doppler 
Spectrum,  would  allow  for  validation  mechanisms  of  this  type  to  be  executed  rapidly. 

The  next  stage  of  testing  involved  working  with  Azimuth-Doppler  spectra  at  a  higher  resolution  (K  =  48). 
This  represented  a  significant  increase  of  the  processing  time,  since  it  involved  a  size  increase  of  the 
convolution  matrix  H  from  dimensions  3  l2-by-3 12  to  dimensions  482-by-482.  However,  this  was  necessary 
so  that  experimentation  with  realistic  N=14  and  M=16  parameters  could  be  performed.  Since  the  number  of 
lobes  in  the  impulse  response  or  kernel  of  the  transformation  in  each  dimension  of  the  Azimuth-Doppler 
Spectrum  is  given  by  N  and  M,  the  higher  number  of  lobes  can  only  be  adequately  represented  if  the 
resolution  of  the  spectrum  is  also  higher.  In  fact,  even  at  a  resolution  K  =  48,  the  main  lobe  of  the  kernel 
generated  assuming  N  =  14,  M  =  16  is  represented  by  a  smaller  number  of  Azimuth-Doppler  spectral  values 
than  for  N=M=6,  at  K=31.  This  reduction  in  effective  resolution  will  make  the  process  of  deconvolution 
more  susceptible  to  mismatch  between  the  spectral  expression  of  a  target  and  the  kernel  itself.  This 
mismatch  seems  to  result  in  a  target  indication  in  the  target  field  that  appear  to  be  more  “fragmented”  , 
comprising  a  number  of  peaks  clustered  together  (as  oppose  to  a  single  peak).  This  effect,  however, 
impacted  only  the  NNLS  implementation  of  deconvolution,  which  also  was  observed  to  be  more  susceptible 
to  the  generation  of  secondary  peaks  at  the  intersections  of  target  side  lobes  and  the  clutter  ridge,  as 
explained  above.  Van  Cittert’s  method,  on  the  other  hand,  seemed  to  be  much  less  affected  by  the  possible 
mismatch  between  the  actual  target  and  the  kernel.  This  method,  in  fact,  continued  to  produce  thin  pulses 
indicating  the  target  characteristics.  Interestingly,  the  number  of  non-zero  target  field  values  clustered  in  the 
resulting  thin  pulse  seems  to  be  about  the  same  as  for  the  lower  resolution  (K  =  31)  examples.  In  that  sense, 
then,  the  indications  obtained  from  Van  Cittert’s  deconvolution  were  more  specific  at  this  higher  resolution 
(K  =  48). 

The  effectiveness  of  a  Sample  Matrix  Inversion  (SMI)  adaptive  spatial  filter  for  removing  jammer 
interference  prior  to  deconvolution  was  tested  at  the  resolution  of  K  =  48,  for  two  different  levels  of  clutter 
power,  0  dB  and  30  dB.  In  preliminary  studies  during  the  preceding  Summer  Faculty  Research  Program 
project,  we  had  detected  that  the  spectral  structure  of  barrage  jammer  was  recovered  by  the  deconvolution 
process  as  a  line  of  target  indicators  along  the  same  jammer  azimuth  value,  causing  the  process  not  to 
retrieve  valid  target  information.  We  fulfilled  one  of  the  proposed  goals  of  this  SREP  project  by 
incorporating  the  SMI  adaptive  spatial  filter,  with  diagonal  loading,  as  a  pre-processor  for  the 
deconvolution  approach.  The  operation  of  this  pre-processor  is  not  computationally  expensive.  This 
operation  was  outlined  in  the  section  on  methodology  of  this  report. 

The  results  obtained  from  a  simulated  spectrum  in  which  a  significant  jammer  component  had  been  added 
to  the  clutter  and  target  components  were  positive.  No  artifactual  target  field  peaks  were  developed  at  the 
angle  of  the  jammer  when  the  data  coming  out  of  the  SMI  pre-processor  was  deconvolved.  For  the  case 
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including  low-power  clutter  (0  dB)  the  only  elements  retrieved  in  the  target  field  were  those  associated  with 
the  actual  target.  When  a  stronger  clutter  component  was  simulated  (30  dB)  the  results  were  similar  to  the 
case  for  those  levels  of  power  in  the  target  and  the  clutter,  without  a  jammer  component.  It  was  observed, 
however,  that  the  SMI  pre-processing  seemed  to  accentuate  the  mismatch  between  target  spectrum  and 
kernel,  as  slightly  more  secondary  peaks  were  generated  by  NNLS  deconvolution  in  the  target  field.  Van 
Cittert’s  deconvolution  was  also  affected  by  SMI  pre-processing  for  the  stronger  clutter  case,  generating 
one  spurious  component,  at  the  comer  of  the  target  field. 

The  last  set  of  tests  carried  out  were  performed  with  a  resolution  of  K  =  48,  on  data  retrieved  from  the 
Common  Research  Environment  for  STAP  Technology  (CREST)  Datalibrary.  The  files  in  the  datalibrary 
were  recorded  under  the  DoD  MountatinTop  Program.  Most  of  the  files  of  interest  were  collected  with  the 
Radar  Surveillance  Technology  Experimental  Radar  (RSTER)  system,  using  14  elements  on  receive.  The 
RSTER  system  is  really  mounted  on  a  stationary  platform.  So,  to  emulate  the  clutter  dispersion  in  the 
Doppler  domain  that  is  characteristic  of  moving  platforms,  the  concept  of  Inverse  Displaced  Phase  Center 
Array  (IDPCA)  is  used.  The  IDPCA  system  transmits  subsequent  pulses  from  different  columns,  in  rapid 
succession,  thus  emulating  the  transmission  of  pulses  from  a  single  source  in  a  platform  that  is  effectively 
moving  [8].  The  specific  file  for  which  deconvolution  results  are  shown,  “stap  1004vl.l030,  cpi  3724, 
range  bin  20”.  Was  recorded  with  the  RSTER  and  the  IDPCA  systems  mounted  at  North  Oscura  Peak,  in 
the  White  Sands  Missile  Range.  In  this  particular  case  there  is  a  mountainous  are,  called  “Diamond  Peak”  , 
at  a  distance  of  about  154  Km  from  the  RSTER,  spanning  an  azimuth  interval  around  -20°  (with  respect  to 
the  broadside  of  the  RSTER  at  the  time  this  record  was  obtained).  For  this  test,  a  moving  target  simulator 
(MTS)  was  set  up  to  introduce  a  target  signal  very  close  to  the  strong  clutter  patch,  at  slightly  less  negative 
azimuth  and  Doppler.  In  these  more  realistic  conditions  the  Van  Cittert’s  method  of  deconvolution  seemed 
to  perform  best,  generating  a  correct  target  indication  and  minimizing  the  strung  clutter  scatterer  to  a 
smaller  target  field  peak  that ,  in  addition,  appears  now  clearly  separated  from  the  target  indication.  In  this 
case  the  NNLS  method  of  deconvolution  was  also  capable  to  generate  its  largest  component  at  the  correct 
target  Azimuth  and  Doppler.  However,  NNLS  generated  more  secondary  peaks,  including  one  that 
represented  a  second  strong  clutter  patch  at  positive  Doppler  and  azimuth. 
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Conclusions 


Through  this  Summer  Research  Extension  Program  project  we  have  been  able  to  further  explore  the 
deconvolution  of  the  Azimuth-Doppler  Spectrum  (ADS)  of  signals  from  an  airborne  multi-element  array 
system,  for  moving  target  detection.  One  of  the  aspects  that  makes  this  approach  of  particular  interest  is 
that  it  is  based  on  retrieving  information  about  the  target  from  the  ADS,  as  opposed  to  other  techniques, 
such  as  Space-Time  Adaptive  Processing  (STAP),  which  concentrate  in  the  identification  of  interference 
components  (clutter  and  jammer)  for  their  cancellation.  As  such,  the  deconvolution  approach  can  be  used  in 
conjunction  with  STAP,  to  complement  it  or,  possibly,  to  guide  the  selection  of  Azimuth-Doppler  cells  that 
STAP  scrutinizes,  by  assigning  corresponding  space-time  steering  vectors  for  evaluation. 

This  study  confirmed  the  feasibility  of  using  a  simple  pre-processor,  such  as  the  Sample  Matrix  Inversion 
spatial  filter  with  diagonal  loading,  to  remove  jamming  components  in  the  ADS  before  attempting  its 
deconvolution.  Our  preliminary  study  (SFRP)  had  revealed  that,  while  deconvolution  is  capable  of  rejecting 
the  clutter  interference,  it  is  not  robust  against  significant  jammer  components. 

In  dealing  with  Azimuth-Doppler  spectra  containing  target  and  clutter  components,  the  deconvolution 
concept  was  shown  to  have  the  capacity  to  retrieve  the  target  information.  The  clutter  interference  was 
canceled  or  attenuated  significantly  for  cases  where  the  power  of  the  background  returns  was  moderately 
higher  than  the  target  to  be  retrieved  (30  to  40  dB  stronger). 

In  this  project  we  explored  the  deconvolution  of  data  at  progressively  higher  levels  of  realism.  Both 
deconvolution  approaches  tested,  NNLS  and  Van  Cittert’s  method,  were  able  to  isolate  the  target 
information  in  simulated  cases  where  the  relative  spectral  resolution  allowed  a  good  match  between  the 
proposed  “impulse  response”  or  “kemer  and  the  ADS  expression  of  the  target  (N=M=6,  K=31).  When 
more  standard  values  for  the  number  of  antenna  elements  and  pulses  per  CPI  were  used  (N=14,  M=16),  an 
increase  in  the  ADS  resolution  was  needed  (K=48),  at  the  expense  of  the  associated  increase  in  computation 
time.  In  these  cases  the  potential  mismatch  between  the  kernel  and  the  target  expression  in  the  ADS  resulted 
in  the  appearance  of  some  secondary  components  in  the  recovered  target  field.  This  was  observed  mainly  in 
the  intersection  of  spectral  structures  due  to  the  target  and  the  clutter,  or  in  the  corner  of  the  ADS.  The 
insight  gained  about  these  secondary  peaks  leads  us  to  the  conclusion  that  they  can  be  discarded  by  a 
relatively  simple  post-processor. 

Both  forms  of  deconvolution  implemented  in  this  project  were  able  to  retrieve  information  about  the  target 
in  the  test  with  the  CREST  data  shown.  The  Van  Cittert’s  method  seemed  to  be  less  affected  by  possible 
kernel-target  mismatch  and  also  performed  better  in  the  CREST  data  test.  This  method  has  an  added 
advantage  in  that  it  can  be  set  to  perform  a  pre-determined  number  of  iterations,  each  one  of  them  involving 
a  known  number  of  operations,  which  makes  its  timing  predictable  and  dependent  only  on  the  ADS 
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resolution.  The  Non-Negative  Least  Squares  (NNLS)  method,  on  the  other  hand,  has  a  computation  time 
that  is  data-dependent  and  varies,  even  for  a  fixed  ADS  resolution  level. 

Having  available  the  C  implementation  of  both  deconvolution  methods,  as  a  result  from  this  project,  our 
group  is  currently  contemplating  the  parallelization  of  both  of  them,  which  could  bring  their  execution  times 
closer  to  the  real-time  requirements  determined  by  the  application.  This  will  also  enable  us  to  study  a  larger 
number  of  cases  and  variations  of  the  methods,  possibly  at  higher  levels  of  ADS  resolution. 
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APPENDIX:  C-implementation  for  Van  Cittert’s  Deconvolution 

(NOTE:  The  length  of  the  C-code  for  NNLS  deconvolution  makes  it  impractical  to  include  it  in  this 

appendix.  The  code,  however,  is  available  from  the  principal  investigator  at:  barreto@eng.fiu.edu 

/**********yan_Qttert’s_Deconvolution_of_a_48-by-48  Azimuth-Doppler  Spectrum******  I 

# include  <stdio.h> 

# include  <math.h> 

# include  <stddef.h> 

# include  <time.h> 

#include  <cblas.h> 


♦define  MAXSTRING  55 
♦define  MAX  2304 
♦define  SQ_MAX  48 
♦define  MAX_SQ  5308416 
♦define  MAX_ITER  100 


/******************  Global 
double  values [MAX] ; 

/* 

/* 

/* 

/* 


Variable  Definitions 


*/ 
*/ 
*/ 
*/ 


This  array  will  hold  the 
column  vector  values  read 
in  from  the  text  files  created 
in  Mat lab  (target  and  PSD) 


double  bvalues [MAX_SQ] ; 


/*  This  array  will  hold  the  big 
/*  column  vector  values  read  in  from 
/*  the  text  files  created  in  Matlab 


*/ 

*/ 

*/ 


double  bmatrix[MAX] [MAX] ;  /*  This  array  will  hold  the 

/*  square  matrix  values 
/*  reshaped 


*/ 

*/ 

*/ 


int  length [2];  /*  lengths  of  column  vectors  for:  */ 

. j-..-..  ^  o  =  power  spectral  density  (y) ,  */ 

/*  1  =  point  spread  function  (H)  */ 


void  main (int  argc,  char  **argv) 

I  ************************ *  Variable  Definitions  * 
int  m,  n,  o,  i(  j,  k,  iteration,  count,  N, 


★★**★★*★★*★********* 
M,  incx,  incy; 


*/ 


double  target_0 [MAX] ,  target_l [MAX]  ,  target_m [MAX] ,  residue [MAX]  , 
cir_conv [MAX] ,  radius; 

double  alpha,  local_temp,  temp,  acc,  total_time,  dN,  cl,  c2; 
time_t  timer, tl,t2; 


FILE  *outdata,  * indata [2 ]; 

/*********  check  if  correct  number  of  arguments  were  passed  ******  / 

if  (argc  <  5) 

{  fprintf  (stderr ,  "Usage : \t%s\tPSD  file\tPSF  (H) 

f ile\tRecovered  Target  alpha\n" ,  argv  [0]); 
exit  (0) ; 

} 

alpha  =  atof (argv [4] ) ; 
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/**********  Read  values  contained  in  1st  two  argument  files  *********  */ 
for  (n=0;  n<2;  n++) 

{ 

indata[n]  =  fopen(argv[n+l] ,  "rb"); 

length [n]  =  0; 
count  =  length [ n]; 

if  (n==0 ) 

while  ( fscanf ( indata [n] ,  "%lf\n",  & values [count] )  ==  1) 

{  length [n]++; 

count  =  length [n] ;  } 

else 

while  (fscanf (indata [n] ,  "%lf\n",  &bvalues [count] )  ==  1) 

{  length [n]++; 

count  =  length [n] ;  } 

dN  =  sqrt (length [n] ) ; 

N  =  ( int ) dN ; 

printf ( " \nNumber  of  values  in  %s  =  %d\n" ,  argv[n+l] , 
length [n] ) ; 

printf  ("Square  Root  of  %d  =  %d\n" ,  length [n] ,  N)  ; 

} 

/*****  Reshape  column  vector  into  a  square  matrix  for  dot  product******/ 
for  ( i=0 ;  i<N;  i  +  +) 

for  ( j  =  0 ;  j  <N;  j++) 

{  k  =  (N*i)  +  j; 

bmatrix [ i ] [ j ]  =  bvalues[k];  } 

/*********  Initiallize  target  matrix  to  PSD  read  in  from  file*********/ 
for  (i=0;  i<length[0] ;  i  +  +) 

target_0[i]  =  values [i]  ; 

n  =  length [0] ; 
incx  =  1 ; 
incy  =  1 ; 

/*****  Start  time  for  Van  Cittert's  Algorithm  *****/ 
tl  =  time (&timer ) ; 

for  ( iteration=0 ;  iteration  <  MAX_ITER;  iteration++) 

{  for  (i=0;  i<length[0];  i++) 

{  cir_conv[i]  =  ddot(n,  bmatrix[i],  incx,  target_0,  incy); 
residue[i]  =  values[i]  -  cir_conv[i] ; 

} 

for  (i=0;  i<length[0];  i++) 

{  target_l[i]  =  target_0[i]  +  (alpha  *  residue[i]); 
target_0[i]  =  (target_l[i]  +  fabs (target_l [i] ) )  /  2; 

} 

} 

/**★**  stop  time  for  Van  Cittert's  Algorithm  *****/ 
t2  =  time (&timer ) ; 

/*************  pass  calculated  data  to  final  argument  **************  */ 
outdata  =  fopen (argv [3 ] ,  "wb" ) ; 
for  ( i=0 ;  i<length[0];  i++) 

fprintf (outdata,  "%f\n",  target_0 [ i ] ) ; 

printf ( "elapsed  time  %d  seconds\n" , t2-tl) ; 


4-36 


t 


Modeling  and  Design  of  New  cold  Cathode  Emitters 
Using  Wide  Bandgap  Semiconductors 


M.  Cahay 

Associate  Professor 
Department  of  Electrical  Engineering 


University  of  Cincinnati 
Cincinnati,  OH  4522 1 


Final  Report  for: 

Summer  Research  Extension  Program 
Wright  Laboratory 


Sponsored  by: 

Air  Force  Office  of  Scientific  Research 
Bolling  Air  Force  Base,  Washington,  D  C. 

and 

University  of  Cincinnati 


December  1997 


5-1 


Modeling  and  Design  of  New  Cold  Cathode  Emitters 
Using  Wide  Bandgap  Semiconductors 


M.Cahay 

Associate  Professor 
Department  of  Electrical  Engineeering 
University  of  Cincinnati 


ABSTRACT 


We  analyze  the  importance  of  current  crowding  in  a  new  cold  cathode  emitter  which  consists 
of  a  thin  wide  bandgap  semiconductor  material  sandwiched  between  a  metallic  or  heavily 
doped  semiconductor  and  a  low  work  function  semimetallic  thin  film.  Potential  material 
candidates  are  suggested  to  achieve  low-voltage  (<  10  V),  room-temperature  cold  cathode 
operation  with  emission  currents  of  several  tens  of  A/  cm2.  We  calculate  the  lateral  potential 
drop  which  occurs  across  the  emission  window  of  cold  cathodes  with  circular  geometry  and 
describe  its  effects  on  the  emitted  current  density  profile.  The  power  dissipation  in  the 
cold  cathode  is  calculated  as  a  function  of  a  dimensionless  parameter  characterizing  the 
importance  of  current  crowding.  We  determine  the  range  of  dc  bias  over  which  cold  cathodes 
of  different  radii  must  be  operated  to  minimize  current  crowding  and  self-heating  effects. 
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I.  INTRODUCTION 


Recently,  there  has  been  renewed  interest  into  cold  cathode  emitters  for  applications  to 
a  variety  of  electronic  devices,  including  microwave  vacuum  transistors  and  tubes,  pressure 
sensors,  thin  panel  displays,  high  temperature  and  radiation  tolerant  sensors,  among  others 
[1,  2],  Introduction  of  such  emitters  would  permit  an  unprecedented  compactness  and  weight 
reduction  in  device  and  equipment  design.  Low  temperature  operation  in  nonthermionic 
electron  emitters  is  very  desirable  for  keeping  the  statistical  energy  distribution  of  emitted 
electrons  as  narrow  as  possible,  to  minimize  thermal  drift  of  solid  state  device  characteristics, 
and  to  avoid  accelerated  thermal  aging  or  destruction  by  internal  mechanical  stress  and 
fatigue.  To  keep  the  emitter  temperature  rise  small  appears  easy  if  the  emitters  are  built  as 
thin  epitaxial  films  using  vertical  layering  technology  due  to  the  extremely  short  heatpaths 
and  excellent  heatsinking  possibilities  offered  with  this  architecture.  For  an  electron  emitter 
to  be  useful  in  microwave  tube  applications  it  should  be  capable  of  delivering  current  densities 
in  excess  of  10  A/  cm 2  and  to  sustain  emission  during  operational  lifetimes  over  periods  of 
10s  hrs.  To  satisfy  this  requirement,  the  structural  and  chemical  composition  must  be  stable. 
This  rules  out  the  historically  practiced  use  of  alkali  metal  films  on  emitter  surfaces  for  the 
lowering  of  electronic  work  functions.  These  films  sublimate,  evaporate  or  surface  migrate 
over  time  and  end  up  on  various  surfaces  inside  the  vacuum  envelop. 

Several  cold  cathode  emitters  have  been  proposed  since  their  first  successful  demon¬ 
stration  by  Williams  and  Simon  [3]  using  a  cesiated  p-type  GaP  structure.  A  review  and 
criticism  of  the  different  cold  cathode  approaches  was  given  recently  by  Akinwande  et  al. 
[4].  In  this  work,  we  propose  a  new  cold  cathode  emitter  concept  and  use  a  simple  model  to 
show  that  the  new  emitter  is  capable  of  achieving  low  voltage  (<  10  V)  room  temperature 
operation  with  emission  current  approaching  100  A/cm2  and  large  efficiencies.  A  prelimi¬ 
nary  report  of  this  work  has  been  published  earlier  [5].  The  architecture  of  the  structure  is 
shown  in  Fig.  1.  The  main  elements  in  the  design  and  functioning  of  such  an  emitter  are 
:  (1)  a  wide  bandgap  semiconductor  slab  equipped  on  one  side  with  a  metallic  contact  [6] 
or  a  heavily  doped  semiconductor  (n++  —  InP)  on  one  side  of  an  undoped  CdS  region  that 
supplies  electrons  at  a  sufficient  rate  into  the  conduction  band  and  (2)  on  the  opposite  side, 
a  thin  semimetallic  film  that  facilitates  the  coherent  transport  (tunneling)  of  electrons  from 
the  semiconductor  conduction  band  into  vacuum.  Of  importance  is  the  mutual  alignment 
of  the  crystalline  energy  levels  at  the  semiconductor-semimetal  film  junction.  This  requires 
the  use  of  new  materials  and  development  of  their  epitaxial  growth  technologies.  For  that 
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reason,  the  choice  of  InP  as  a  substrate  is  particularly  attractive  since  the  lattice  constant 
of  InP  (5.S6  A)  closely  matches  the  lattice  constant  of  the  zincblende  cubic  CdS  (5.83  A). 
Furthermore,  there  have  been  recent  reports  on  the  deposition  of  crystalline  layers  of  CdS  on 
InP  by  molecular  beam  epitaxy  [7],  chemical  bath  deposition  [8],  and  pulsed  laser  deposition 
[9].  The  proposed  cold  cathode  should  therefore  be  realizable  with  present  day  technology. 

As  shown  in  Fig.  1(a),  a  thick  metal  grid  is  defined  on  the  surface  of  the  LaS  thin 
film  to  bias  the  structure.  There  are  openings  in  the  grid  structure  to  expose  the  thin  LaS 
film  which  forms  the  active  emission  area  of  the  cold  cathode.  Cathodes  with  rectangular 
(Fig. 1(b))  emission  windows  were  studied  previously  [5].  Current  crowding  and  self-heating 
effects  in  cathodes  with  circular  geometry  (Fig. 1(c))  emission  windows  will  be  considered 
hereafter.  The  bias  is  applied  between  the  back  metallic  contact  and  the  metal  grid  with 
emission  occuring  from  the  exposed  LaS  surface.  If  the  applied  voltage  is  equal  or  larger 
than  the  semiconductor  bandgap  energy  and  the  quotient  of  the  applied  voltage  divided  by 
the  semiconductor  thickness  approaches  O.leV’/A,  then  electrons  are  tunnel  injected  into  the 
conduction  band  and  ascend  during  their  travel  across  the  semiconductor  film  to  level*  of 
increasing  energy.  Referring  to  Fig.  2,  the  conduction  band  of  the  wide  bandgap  semicon¬ 
ductor  provides  the  launching  site  for  electrons  where  they  are  -  through  a  thin  film  -  injected 
into  vacuum.  This  injection  of  electrons  into  vacuum  becomes  possible  and  is  effective  as 
long  as  the  semimetallic  film  is  very  thin  and  has  a  work  function  small  enough  so  that  its 
vacuum  edge  is  located  energetically  below  the  conduction  band  edge  of  the  semiconductor. 
This  situation  is  referred  to  as  negative  electron  affinity  (NEA)  for  the  semiconductor  ma¬ 
terial  [10].  Depending  on  the  particular  materials  choices,  this  implies  that  the  semimetal 
work  function  <p\t  in  relation  to  the  semiconductor  energy  bandgap  Eg  must  obey  one  of  the 
inequalities  4>m  <  0.5Eg  or  <  Eg  if  an  intrinsic  or  p-type  doped  wide  bandgap  semi¬ 
conductor  is  used,  respectively.  A  negative  <Pm  implies  according  to  Fig.2  that  the  vacuum 
level  would  be  located  below  the  lower  conduction  band  edge.  In  that  case,  electrons  in 
the  conduction  band  with  momenta  pointing  toward  the  surface  have  a  good  chance  to  get 
emitted  unless  deflected  by  collision  or  trapped  by  impurities  or  defects. 

This  paper  is  organized  as  follows.  In  section  II,  we  derive  the  basic  equations  describing 
the  forward  bias  operation  of  the  cold  cathode  emitter  described  above.  We  then  calculate 
the  current  density- voltage  characteristics  of  the  newly  proposed  cold  cathode  for  specific  sets 
of  materials  and  device  parameters.  In  section  III,  we  investigate  the  importance  of  current 
crowding  effects  in  various  cold  cathodes  with  circular  geometry.  Our  analysis  includes  a 
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self-consistent  modeling  of  current  crowding  effects  and  an  analysis  of  power  dissipation  in 
the  cold  cathode  active  area.  The  influence  of  power  dissipation  on  self-heating  effects  in  the 
active  area  of  the  cathode  is  also  described.  Finally,  Sec.  IV  contains  our  conclusions. 


II.  THE  MODEL 

Hereafter,  we  analyze  the  cold  cathode  whose  energy  band  diagram  is  shown  in  Fig.  2 
[5].  Under  the  influence  of  a  large  electric  field  in  the  wide  bandgap  semiconductor,  electrons 
will  eventually  tunnel  from  the  left  contact  through  the  barrier  at  the  metal-semiconductor 
interface.  A  portion  of  the  current  emitted  at  the  metal  or  heavily  doped  semiconductor 
- CdS  contact  (which  we  model  assuming  Fowler- Nordheim  injection)  is  transmitted  at  the 
boundary  of  the  LaS  as  well  as  the  vacuum  boundary.  However,  a  fraction  of  the  current  is 
lost  in  the  thin  LaS  quantum  well  gives  rise  to  the  dynamic  shift  of  the  effective  material 
work  function  (Fig.  2).  For  a  cathode  operated  at  room  temperature,  we  model  this  internal 
field  emission  at  the  injection  junction  using  a  Fowler-Nordheim  (FN)  type  expression  for 
the  injected  current  (in  A/ cm2)  [11] 

JFN  =  Ci(£2/A)ec,AJ/,/£',  (1) 

where  C\  and  C2  are  constants  which  depend  on  the  wide  bandgap  semiconductor.  In  our 
numerical  simulations,  we  chose  C\  =  1.5AT06  A/V  and  Ci  —  6.9AT07( V^^crn)  1  which 
are  of  the  same  order  of  magnitude  as  the  constants  appearing  in  the  FN  expression  [11]. 
In  Eq.(l),  A  is  the  barrier  height  (in  eV)  at  the  metal-semiconductor  junction  and  E  is 
the  electric  field  (in  V/cm)  in  the  wide  bandgap  semiconductor  [12].  We  assume  that  the 
semiconductor  layer  thickness  is  such  that  the  transport  of  injected  electrons  is  close  to  being 
ballistic  up  to  the  interface  between  the  semiconductor  and  the  thin  semimetallic  film.  In  so 
doing,  we  also  neglect  carrier  ionization  processes  in  the  semiconductor  slab  which  could  be 
the  main  antagonist  to  ballistic  transport  in  that  region. 

Because  of  the  finite  probabilities  for  the  injected  current  to  be  transmitted  at  the 
semiconductor-semimetal  (probability  T\)  and  semimetal- vacuum  interfaces  (probability  Tj), 
the  contributions  to  the  total  emitted  current  can  be  calculated  as  the  sum  of  the  contribu¬ 
tions  resulting  from  the  mutiple  reflections  of  electrons  in  the  semimetallic  layer  (See  Fig.  2). 
The  magnitudes  of  the  emitted  current  components  decreases  with  the  number  of  multiple 
reflections  in  the  semimetallic  layer.  Rather  than  trying  to  calculate  these  contributions 
exactly,  we  assume  that  the  current  amplitude  is  decreased  by  a  factor  e  =  exp(  —  L?/ \l<is) 
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for  each  traversal  of  the  semimetallic  layer,  where  \us  is  the  collisional  mean  free  path  in 
the  semimetallic  layer  and  L2  is  the  length  of  the  semimetallic  layer.  Adding  the  contribu¬ 
tions  resulting  from  multiple  crossings  of  the  semimetallic  layers,  the  total  emitted  current 
is  found  to  be 

Jem  —  ^L\TiJq{  1  +  x  +  x2  +  ...),  (2) 

where  x  =  e2(l  —  T\)(l  -  Ti).  In  calculating  Jem  we  limited  the  number  of  traversals  of 
the  semimetallic  slab  to  five  to  include  the  fact  that  electrons  loose  energy  in  each  crossing 
and  eventually  do  not  have  enough  energy  to  surmount  the  barrier  at  the  semimetal- vacuum 
interface.  According  to  Eq.(2),  the  contributions  from  the  multiple  reflections  decrease 
rapidly  since,  in  general,  the  quantity  x  will  be  much  smaller  than  unity  [13].  Once  the 
emitted  current  is  found,  the  total  current  contributing  to  the  increase  in  the  sheet  carrier 
concentration  in  the  thin  semimetallic  film  can  easily  be  written  as  J^t  =  Jfn  —  Jem •  The 
total  trapped  current  is  then  given  by 


IT  =  AJcapt  =  ARJfn 


(3) 


where  A  is  the  area  of  each  LaS  emission  window  which  in  practice  can  be  either  rectangular 
or  circular  (See  Fig.  lb  and  lc).  In  Eq.(3),  R  is  the  trapping  coefficient  of  the  well 

R  =  1  -  tTiT2  (1  -I-  x  +  x2).  (4) 


The  semimetallic  thin  film  can  be  modeled  as  a  quantum  well  (Q.W)  which  will  loose  the 
trapped  electrons  essentially  at  its  lateral  boundaries.  In  reality,  Fig.  1(a)  indicates  that 
not  all  electrons  will  move  to  the  three-dimensional  contact  regions  surrounding  the  thin 
semimetallic  layer  but  many  of  them  will  get  reflected  at  the  lateral  thin  film  layer  with 
an  average  probability  r  (calculated  for  electrons  with  the  Fermi  velocity  in  the  thin  film). 
The  exiting  number  of  electrons  will  depend  on  the  thickness  of  the  semimetallic  layer  and 
could  be  adjusted  by  intentional  passivation  so  that  reflection  at  the  boundaries  of  the  thin 
semimetallic  film  could  be  tuned  from  almost  zero  to  nearly  unity.  Taking  into  account  the 
finite  reflection  amplitude  at  the  thin  film  boundaries,  the  leakage  current  of  the  Q.W  can 
be  rewritten 

cIQt 

=  : 2eLN2DvF{l  -  r),  (5) 

for  the  case  of  a  rectangular  emission  window  and 


JQt 

at 


ea.\2DvF{  1  -  r), 


(6) 
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for  the  case  of  a  circular  emission  window. 

In  Eqns.(5)  and  (6),  Qt  is  the  total  charge  captured  by  the  well,  e  is  the  magnitude  of 
the  electronic  charge,  N2d  is  the  excess  sheet  carrier  concentration  in  the  thin  film  due  to 
the  captured  electrons,  and  vF  is  the  Fermi  electron  velocity  in  the  semimetallic  thin  film. 
Under  steady  state  operation  of  the  cold  cathode,  the  excess  charge  in  the  two-dimensional 
semimetallic  film  is  found  using  Eq.(3)  and  imposing  the  current  balance  requirement  ^  = 
i?  —  AJcapt-  This  leads  to 

N2D  =  WJcapt/2e(l-r)vF,  (7) 

for  the  case  of  a  rectangular  geometry  and 

N2D  =  ajcapt/ 2e(l  -  r)vF ,  (8) 

for  the  case  of  a  circular  geometry. 

Simultaneously,  the  change  N2d  in  the  excess  sheet  carrier  concentration  in  the  Q.W 
due  to  trapped  electrons  leads  to  the  occupation  of  the  boundstate  energy  levels  according  to 
the  energy  density  of  states  up  to  an  energy  level  which  will  establish  the  dynamic  Fermi  level 
Efx.  The  Fermi  velocity  vF  entering  Eqns. (5)  and  (6)  must  be  calculated  self-consistently 
because  of  the  dynamic  work  function  shift  |Ax|  illustrated  in  Fig.  2.  This  dynamical  shift 
|Axl  is  equal  to  \EFX  -  £>°|,  where  EF°  is  the  Fermi  level  in  the  thin  semimetallic  layer 
under  zero  bias.  For  simplicity,  we  assume  that  the  electrons  in  the  conduction  band  of  the 
semimetallic  films  can  be  described  using  the  Sommerfeld  theory  of  metals  while  assuming 
s-band  conduction  in  the  semimetallic  thin  film  and  while  modeling  the  thin  film  using  the 
particle  in  a  box  model  for  the  quantum  well  [14].  The  set  of  equations  (1-8)  is  then  solved 
self-consistently  to  calculate  the  work  function  shift  |Ax|  &&  a  junction  of  the  externally 
applied  bias.  Once  the  dynamic  shift  has  been  determined  self-consistently,  Eq.(2)  can  then 
be  used  to  determine  the  emitted  current. 

RESULTS 

We  consider  a  specific  structure  with  the  material  and  structural  parameters  listed  in 
Table  I  and  II,  respectively.  Both  Au  and  Ag  are  known  to  form  contacts  to  thin  films  of 
semiconducting  (n-tvpe)  CdS.  In  that  case,  the  barrier  height  A  shown  in  Fig.  2  is  quite 
small  and  is  equal  to  0.7S  eV  and  0.56  eV  for  the  case  of  Au  and  Ag  contacts,  respectively 
[11].  The  lattice  constant  of  CdS  (5.S3.4)  is  very  close  to  the  lattice  constant  of  the  thin 
semimetallic  surface  layer  LaS  (5. So  A)  which  in  its  cubic  crystalline  structure  will  therefore 
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be  lattice  matched  to  the  semiconducting  material.  Additionally,  LaS  is  expected  to  have 
quite  a  low  room  temperature  work  function  (1.14  eV)  [15],  a  feature  when  combined  with 
the  large  energy  gap  (2.5  eV)  of  C'dS  leads  to  NEA  of  the  semiconductor  material.  In  the 
following  numerical  simulations,  the  thicknesses  of  the  CdS  (L\)  and  LaS  (Li)  layers  are  set 
equal  to  500  A  and  24.6  A  (4  monolayers),  respectively.  We  model  a  cathode  with  a  square 
(W  =  L)  emission  window  with  a  1  cm 2  area. 

Figure  3  is  a  plot  of  the  dynamic  work  function  shift  as  a  function  of  applied  bias  for 
the  cold  cathode  emitter  with  both  Au  and  Ag  injecting  contacts.  The  following  parameters 
were  used:  A  us  =  300  A,  7\  =  Ti  =  0.5,  and  vp  =  1.36X108  cm/s.  Figure  3  indicates  that 
the  dynamic  shift  of  the  LaS  work  function  is  sensitive  to  the  quality  of  the  interface  between 
the  two-dimensional  semimetallic  layer  and  the  three-dimensional  contacts  which  we  model 
by  varying  the  reflection  coefficient  r  between  the  two-dimensional  semimetallic  thin  film 
and  the  three-dimensional  contact  regions  (See  Fig.  1(a)).  It  should  be  noticed  that  the  LaS 
work  function  shift  can  approach  the  LaS  workfunction  even  for  the  case  of  a  leaky  interface 
between  the  thin  semimetallic  layer  and  the  3D  contact  regions.  The  dynamic  shift  |Ax|  is 
comparable  to  the  work  function  of  LaS  for  a  smaller  value  of  the  applied  bias  in  the  case  of 
Ag  contact  because  of  the  lower  barrier  at  the  Ag/CdS  interface. 

Figure  4  compares  the  emitted  current  densities  Jem  for  the  structure  with  Au  and  Ag 
contacts  calculated  while  including  or  neglecting  the  effects  of  the  dynamic  shift  of  the  LaS 
work  function.  The  current  density  versus  bias  plots  are  stopped  at  the  values  of  V L  at 
which  |Ay|  =  <Psi(LaS )  =  l.lieV.  Beyond  that  point,  the  theory  exposed  here  is  no  longer 
valid  since  we  would  need  to  include  the  spill  over  of  the  excess  trapped  carriers  into  vacuum. 
As  can  be  seen  in  Fig.  4,  the  emitted  current  densities  can  be  more  than  a  factor  two  larger 
when  the  effects  of  the  dynamic  shift  of  the  work  function  of  the  semimetal  are  included. 
The  effects  could  be  made  more  drastic  if  a  set  of  materials  and  device  parameters  could  be 
found  for  which  the  dynamic  shift  of  the  work  function  could  be  made  comparable  to  the 
work  function  itself  at  fairly  low  value  of  the  applied  bias  (<  5V). 

Sensitivity  of  Dynamic  Work  Function  Shift  on  Design  Parameters 

The  previous  numerical  examples  have  shown  that,  under  forward  bias  operation,  the 
electrons  captured  in  the  low  work  function  material  are  responsible  for  an  effective  reduction 
of  the  semimetallic  film  work  function  together  with  a  substantial  increase  of  the  cathode 
emitted  current.  This  dynamic  work  function  shift  was  shown  to  increase  with  the  amount  of 
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injected  current.  Hereafter,  we  perform  a  more  extensive  study  of  the  dynamic  work  function 
shift  which  includes  variations  of  the  length  of  the  CdS  region  (L i),  the  electron  mean  free 
path  in  the  LaS  region  (A^),  the  emission  window  size  (W),  the  transmission  coefficients 
at  the  CdS / LaS  (T\)  and  LaS  fV acuum  interfaces  (T2),  and  the  reflection  at  the  2D/3D 
interface  region  in  the  LaS  quantum  well  (r). 

Figure  3  indicates  that  the  dynamic  work  function  shift  |A*|  rises  exponentially  above 
a  threshold  voltage  of  several  volts  and  reaches  rapidly  (within  a  few  volts  range)  a  value 
comparable  to  the  LaS  work  function.  For  a  structure  with  the  parameters  listed  in  Table  I 
and  with  the  structural  and  physical  parameters  (L\  =  500  A,  Li  =  24.6A,  W  =  1cm,  A  us  = 
300A,  Ti  =  Ti  =  0.5,  A(A3)  =  0.56eV  ),  Fig.  5(a)  shows  that  the  dynamic  work  function 
shift  rises  exponentially  at  a  lower  bias  as  the  reflection  coefficient  at  the  2D/3D  interface 
in  the  LaS  region  approaches  unity.  Figure  5(b)  also  shows  that  the  difference  between 
the  current  densities  calculated  with  and  without  including  the  dynamic  work  function  shift 
are  more  pronounced  for  smaller  values  of  the  applied  bias  when  the  reflection  coefficient 
between  the  2D  and  3D  LaS  regions  is  approaching  unity.  This  results  from  the  fact  that 
any  mechanism  (like  r  being  closer  to  unity)  which  increases  the  amount  of  charge  being 
trapped  in  the  LaS  quantum  well  leads  to  an  enhancement  of  the  dynamic  work  function 
shift  at  a  given  bias.  For  instance,  all  other  cathode  parameters  being  equal,  the  dynamic 
work  function  rises  much  faster  as  a  function  of  applied  bias  in  structures  with  thinner  CdS 
regions  (Fig.  6),  with  smaller  values  of  the  transmission  coefficients  Ti  and  T2  (Fig.  7),  or 
with  smaller  mean  free  path  (A LaS)  in  the  semimetallic  thin  film  (Fig.  8).  As  shown  in  Fig. 
8(a),  the  dynamic  work  function  shift  occurs  at  a  lower  bias  as  Xus  is  decreased.  On  the 
other  hand,  the  emitted  current  density  is  lesser  at  a  given  bias  in  a  cathode  whose  LaS  thin 
film  has  a  lower  electron  mean  free  path  (Fig.  8(b)).  We  have  found  this  trend  to  be  valid 
for  ail  values  of  the  reflection  coefficient  at  the  LaS  2D/3D  interface.  However,  there  is  a 
larger  spread  in  the  family  of  curves  representing  the  bias  dependence  of  the  dynamic  work 
function  and  emitted  current  densities  as  a  function  of  the  mean  free  path  in  the  LaS  thin 
film  for  smaller  values  of  the  reflection  coefficient  r.  Finally,  even  though  not  shown  here, 
the  exponential  rise  of  the  dynamic  work  function  shift  and  emitted  current  density  has  been 
shown  to  occur  at  lower  value  of  the  bias  by  either  reducing  the  thickness  of  the  CdS  layer 
or  by  lowering  the  barrier  height  A  at  the  metal/ CdS  interface. 

Before  leaving  this  section,  we  make  two  additional  remarks  on  the  bias  dependence 
of  the  emitted  current  density  and  the  dynamic  work  function  shift  which  we  have  checked 
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numerically  on  all  the  cold  cathodes  modelled  in  this  work.  First,  since  the  emission  of 
electrons  at  the  metal(or  heavily  semiconductor)/CdS  interface  is  assumed  to  be  of  the 
Fowler-Nordheim  type,  the  emission  current  is  expected  to  have  a  bias  dependence  of  the 
form 

Jfn  =  AV^Sexpi-B/Vna*)-  (9) 

This  was  checked  numerically  for  all  the  cold  cathodes  modelled  here.  A  typical  example  is 
given  in  Fig. 9(a),  with  the  values  of  the  parameters  A  and  B  listed  in  the  inset.  Most  of 
the  quantities  of  interest  to  be  determined  hereafter  (power  dissipation,  lateral  variation  of 
emitted  current  and  lateral  potential  drop,...)  can  be  calculated  exactly  analytically  if  the 
following  approximation  is  used 

Jfn  =  JoeaVfc*“'.  (10) 

For  all  the  cold  cathodes  simulated  here,  we  have  shown  that  Eq.(10)  gives  a  fairly  accurate 
fit  to  the  plot  of  the  emitted  current  density  versus  applied  bias  if  the  range  of  the  fit  is 
restricted  to  current  densities  between  1  and  1000  Ajcmr  .  A  typical  fit  for  the  one  of  the 
cold  cathode  studied  here  is  shown  in  Fig. 9(b)  with  the  values  of  the  parameters  Jq  and  a 
in  Eq.(8)  shown  in  the  inset.  Table  III  gives  a  summary  of  the  parameters  Jo  and  a  for  cold 
cathodes  of  different  width  and  with  the  physical  parameters  listed  in  the  caption  of  Fig.5. 
We  have  found  that  the  bias  dependence  of  the  dynamic  work  function  shift  is  also  of  the 
Fowler-Nordheim  type,  i.e, 


Ax  =  %Viia,2exp(-V( o/Vtoat).  (11) 

This  is  illustrated  in  Fig.  10  for  the  cold  cathode  with  the  same  parameters  a s  in  Fig. 9.  The 
values  of  A0  and  Vo  are  indicated  in  the  inset  of  Fig.  10.  We  point  out  that  the  values  of  the 
parameters  B  and  V0  in  Eqns.(9)  and  (10)  are  nearly  identical. 

Temperature  Rise  in  the  Cold  Cathode 

Hereafter,  we  derive  an  upper  estimate  of  the  temperature  rise  in  the  LaS  thin  film  as  a 
result  of  the  power  dissipation  mechanisms  discussed  above.  We  focuss  on  a  cold  cathode 
where  the  CdS  thin  film  is  deposited  on  a  InP  substrate  as  shown  in  Fig.l(a).  The  successful 
growth  of  cubic  CdS  thin  films  with  good  crystalline  quality  on  InP  substrates  has  been 
reported  recently  by  Shen  and  Kwok  [9j.  For  the  case  of  a  InP/CdS  interface,  there  has  not 
been  any  report  of  the  conduction  band  discontinuity  tlEc  at  the  interface  between  the  two 
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materials,  to  the  best  of  our  knowledge.  For  that  reason,  AEe  was  assumed  to  be  given  by 
Anderson’s  rule,  i.e,  A  in  Fig.  2  is  assumed  to  be  given  by  |*(/nP)  -  *(CdS)|  =  0-2  eV 
[16].  This  estimate  was  based  on  the  measured  electron  affinities  of  InP  (4.4  eV)  and  CdS 
(4.2  eV)  reported  in  refs.  [17]  and  [IS],  respectively.  The  back  contact  to  the  substrate  is 
assumed  to  be  perfectly  ohmic  and  to  act  as  a  perfect  heat  sink  (300I<).  Since  the  device 
area  (heat  source  formed  of  the  LaS  thin  fim)  is  much  thinner  than  the  substrate,  it  is 
necessary  to  consider  the  effect  of  heat  spreading  laterally  in  the  substrate.  Furthermore, 
we  assume  that  the  temperature  of  the  CdS  layer  will  be  the  same  as  the  LaS  top  layer. 
The  active  area  of  the  cathode  (CdS  and  LaS  layers)  is  therefore  assumed  to  be  acting  as  a 
heat  source  with  the  power  density  calculated  in  the  previous  section.  Because  of  the  finite 
thermal  conductivity  of  the  substrate,  we  expect  self-heating  effects  to  affect  the  operation 
of  the  cold  cathode  if  the  power  level  dissipated  in  the  active  area  of  the  cathode  becomes 
too  important.  Hereafter,  we  model  the  thermal  conductivity  of  the  InP  substrate  as  follows 

k(T)  =  MF/To)-6,  (12) 


where  k0  is  the  thermal  conductivity  at  T0  (300K),  /c0  =  0.74  W/Kcm  is  the  room  temperature 
thermal  conductivity  of  InP  and  b=1.45.  Starting  with  Fick’s  law  and  making  use  of  a 
Kirchoff  transformation  to  take  into  account  the  temperature  dependence  of  the  thermal 
conductivity  of  the  InP  substrate  given  by  Eq.(l2),  it  can  be  shown  that  the  active  area  of 
the  cold  cathode  will  be  operated  at  a  temperature  given  by 

RtkiaPdi.,]^  nil 

— jpi -  ’  ^ 

io  J 

where  Tq  is  the  ambient  room  temperature  (assumed  to  be  300  K  hereafter),  Pa„  is  the  total 
power  dissipated  per  finger  as  calculated  in  the  previous  section,  and 


* 

Kq  Jo  A{z) 


(14) 


where  Zs  is  the  thickness  of  the  InP  substrate.  Our  estimate  of  the  temperature  rise  in  the 
active  cold  cathode  area  will  give  an  upper  estimate  of  the  temperature  of  operation  since 
we  neglected  heat  conduction  to  the  top  Au  contacts  in  the  thick  portion  of  the  LaS  thin 
film. 
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III.  Current  Crowding  Effects  in  Proposed  Cold 
Cathode  with  a  Circular  Geometry 


In  this  section,  we  study  the  effects  of  current  crowding  in  the  case  of  an  emission  window 
with  circular  geometry  (see  Fig. 1(c)).  The  lateral  potential  drop  in  the  circular  LaS  window 
satisfies  the  following  differential  equation  [19] 

£  =  7 i(r)'  <I5> 

where  i(r)  is  the  total  lateral  current  per  unit  length  flowing  outward  across  a  circle  of  radius 
r,  whose  center  coincide  with  the  center  of  the  emission  window.  The  lateral  current  satisfies 
the  following  equation 

2  jrri(r)  =  2nR  [  dr'r  j(r).  (16) 

Jo 

If  we  further  assume  that  the  Fowler-Nordheim  emitted  current  j(r)  can  be  approximated 
by  Eq.(10)  over  the  range  of  dc  bias  considered  here,  the  following  second-order  differential 
equation  must  be  satisfied  by  the  lateral  potential  drop: 

fh  +  ~  =  ££*?VM/Vr.  (17) 

dr 2  r  dr  t 

This  differential  equation  must  be  solved  subject  to  the  following  boundary  conditions  valid 
for  the  circular  geometry 


at  r  =  0,  and 


V(r  =  a)  = 


at  the  edge  of  the  circular  window.  Introducing  the  reduced  variable  r  =  r /a  and  the 
quantity 

r  =  (M) 


Eq.(17)  can  be  recast  as  follows 


ePY  ,  1  dY  n  2  y 

J  ,2  "**  I  j  '  —  “7  ®  > 

dr  r  dr 


where  ~/2  =  and  the  parameter  3  is  identical  to  the  one  defined  for  the  planar  problem 

1201. 
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The  general  solution  of  Eq.(21)  can  be  found  analytically  and  is  given  by 

n\ -  cr'‘)X  (22) 

where  8  and  c  are  constants  to  be  determined  so  the  boundary  conditions  (18)  and  (19)  are 
satisfied.  Using  the  new  system  of  variables,  Eq.(lS)  becomes 


while  Eq.(19)  now  reads 


Equation  (23)  leads  to 


and  Equation  (24)  becomes 


U(1)  =  0. 


8  =  2. 


*«(l-c)V/c. 


Combining  these  last  two  equations,  we  obtain  the  following  result 

=  (i^r  (27) 

Equation  (27)  can  be  solved  exactly  for  the  parameter  c 

C  =  !tj  +  2  -  ’yV  +  1]/T*  (28) 

in  terms  of  which  we  can  write  various  quantities  of  interest,  including  the  ratio 

J(0)/J(a)  =  [1  -  c]2,  (29) 

characterizing  the  importance  of  current  crowding  in  the  circular  geometry.  Using  Eqn.(22), 
the  radial  dependence  of  the  lateral  potential  drop  if  found  to  be 


V(r)  =  Yu.,  -  2 VTln{ 


a 2  —  cr 2 


v  V  )  —  ***•  ~  _C)J’  V  7 

from  which  the  maximum  value  of  the  in  plane  electric  field  is  found  to  be 

Er{r  =  a)  =  (31) 

v  c  —  1  a 

As  in  the  case  of  the  rectangular  geometry,  the  total  power  dissipated  in  the  LaS  thin 
film  is  given  by  the  sum  of  the  following  four  contributions  [20].  The  power  dissipated  by 
the  electrons  being  trapped  in  the  LaS  circular  thin  film  is  given  by 


Pi  =  2: ~R  [  drj(r)V(r). 
Jo 
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The  power  dissipated  by  Joule  heating  as  trapped  electrons  move  to  the  edge  of  the  LaS 
window  is  given  by 

n  r  drr  r  '  ■/  .  '12 


J  -r(  [  r 'i(r')<fr']2. 
j  o  r  J  o 


The  third  contribution  to  power  dissipation  comes  from  from  Joule  heating  linked  to  the 
current  making  it  from  the  LaS  thin  film  to  the  Au  contacts  on  top  of  the  thick  LaS 


regions. 


P3  =  Rc(2irai(a))2. 


where  is  the  resistance  of  the  LaS  region  between  the  edge  of  the  LaS  thin  film  and  the 
top  Au  layer.  This  resistance  can  be  estimated  as  follows  [21] 


Rc  = 


psH  Ink 
t2ira  k  —  1  ’ 


where  k  =  b/t,  a  is  the  radius  of  the  circular  window,  and  H  is  the  height  of  the  thick  LaS 
region. 

Finally,  there  is  also  a  contribution  to  power  dissipation  due  to  the  blocking  effect  on 
the  Folwer-Nordheim  emission  current  emitted  under  the  wide  LaS  contacts: 


Pa  =  vb(b+2a)J0eaV*".  (36) 

Starting  with  Eqns.(lO)  and  (30),  the  different  contributions  to  the  power  dissipation 
can  be  calculated  exactly  and  are  found  to  be 

Px  =  7ra2JoReaV^-Vbm3(l  -  c)  -  ira2 J0Re*v"" VT[2{1  -  c)  +  (1  ~  C)2/n(l  -  c)2],  (37) 

c 

P2  =  P(Ta2J0eaVHVr(2(l  -  c)  +  -(1.— )2/n(l  -  c)2],  (38) 

c 

Pz  is  given  by  Eq.(34)  and  P4  is  found  to  be 

_  PcP2  2 

4  ~  2(1  -  P)2iem  ’ 

where  I m  is  the  total  emitted  current  through  the  circular  window 

Iem  =  2rr(l  -R)J  ri(r)dr ■  (40) 

The  latter  can  be  calculated  explicitely  and  is  found  to  be 

hm  =  (1  -  P)[-aV0ealH(  1  ~  c).  (41) 
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The  input  power  (per  emission  window)  delivered  by  the  power  supply  biasing  the  cold 
cathode  is  given  by 

P mput  =  /  2  irrj{r)V(r)dr,  (42) 

J  0 

which  can  readily  be  shown  to  be  given  by 

P,n„t  =  Pi/R  +  P<-  (43) 


The  power  efficiency  of  the  cold  cathode  can  be  calcualted  as  follows 

Ptnput  Pditt 


T]p  = 


input 


(44) 


and  the  temperature  rise  in  the  cathode  is  given  by  Eq.(13).  where  the  thermal  resistance 
Rth  o  must  be  calculated  for  the  case  of  power  dissipation  through  the  substrate  from  a  heat 
source  with  circular  geometry.  In  this  case,  we  find 

rrai'  («> 

Ko  a  [a  +  z,tanO\ 

in  which  the  heat  spreading  angle  9  is  set  equal  to  45°  in  the  numerical  examples  below,  for 
simplicity. 


Numerical  Examples 

Figure  11  shows  the  variation  of  the  parameter  c  in  Eq.(28)  as  a  function  of  applied 
bias  for  cold  cathodes  with  circular  emission  window  of  different  radii.  For  ail  cathodes, 
the  physical  parameters  are  the  same  as  listed  in  the  caption  of  Fig. 5.  As  in  the  case  of  a 
rectangular  window,  we  use  the  criterion  that  current  crowding  is  negligible  if  the  lateral 
potential  drop  between  the  center  and  the  edge  of  the  LaS  circular  window,  V(r  =  0)  —  Hia3, 
is  kept  less  than  0.1  VT.  Using  Eqns.(28)  and  (30),  we  find  that  this  criterion  requires  the 
parameter  c  to  be  less  than  0.05.  This  limit  is  indicated  as  a  vertical  line  in  Fig.  11.  The 
family  of  curves  in  Fig.  11  is  parametrized  with  the  radius  of  the  emission  window.  Figure  11 
shows  that  the  range  of  dc  bias  over  which  current  crowding  can  be  neglected  in  a  circular 
window  is  comparable  to  the  range  of  dc  bias  over  which  current  crowding  is  negligible  in  a 
rectangular  window  whose  width  is  equal  to  the  radius  of  the  circular  emission  window  [20]. 

Figure  12  illustrates  the  importance  of  current  crowding  on  the  lateral  potential  drop 
in  emitter  windows  of  different  radii.  The  left  frames  show  the  radial  dependence  of  the 
electrostatic  potential  for  four  different  values  (1,10,100,1000  A/cm2)  of  the  current  density  at 
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the  rim  of  the  circular  LaS  window.  The  right  frames  in  Fig.  12  show  the  corresponding  radial 
dependence  of  the  emitted  current  density.  From  Fig.12,  it  can  be  seen  that  current  crowding 
is  negligible  in  emitter  windows  with  radius  less  than  50  pm  if  the  emitted  current  density 
is  kept  under  10  A/cm2.  As  in  the  case  of  LaS  windows  with  rectangular  geometry,  Figure 
12  shows  that  the  current  density  profiles  are  much  more  sensitive  to  the  finite  resistivity  of 
the  LaS  thin  film  than  the  lateral  potential  drop. 

Figure  13  is  a  plot  of  the  four  contributions  to  the  total  power  dissipated  in  cold 
cathodes  with  different  radii  plotted  as  a  function  of  the  parameter  c.  For  all  cathodes,  the 
power  dissipation  due  to  Joule  heating  in  the  LaS  thin  film  and  the  thick  LaS  regions  is 
negligible  compare  to  the  power  released  by  electrons  being  trapped  in  the  LaS  thin  film  and 
by  electrons  blocked  in  the  thick  LaS  regions.  The  latter  is  always  about  one  of  magnitude 
higher  than  the  former.  Figure  13  shows  that  substantial  power  dissipation  occurs  in  the 
cathode  with  radius  under  50^m  while  the  cathode  is  still  operating  without  any  substantial 
current  crowding  effects  (i.e,  c  <  0.05).  The  power  efficiency  rjp  of  cathodes  of  different 
width  is  plotted  as  a  function  of  Hia,  and  the  emitted  current  density  Jm  in  Figs.  14(a) 
and  14(b),  respectively.  For  all  window  size,  the  efficiency  decreases  with  Hia,  and  Jm  as  a 
result  of  current  crowding.  The  efficiency  is  more  or  less  constant  over  a  wider  range  of  Ha, 
for  window  with  smaller  radius  because  current  crowding  is  less  important  in  that  case,  as 
illustrated  in  Fig.  12.  The  overall  lower  efficiency  for  window  with  smaller  radius  illustrated 
in  Fig.  14  comes  from  the  fact  that  the  width  of  the  thick  LaS  regions  was  set  equal  to 
lOO^m  for  all  cathodes.  The  efficiency  of  cathodes  could  be  increased  by  milking  the  ratio 
b/a  in  Fig.  1(b)  closer  to  unity. 

Figure  15  shows  the  temperature  of  the  active  area  of  a  cold  cathode  with  the  param¬ 
eters  listed  in  Table  I  as  a  function  of  Ha,  for  emitter  window  with  different  radii.  The 
thickness  of  the  LaS  contacts  and  InP  substrate  was  set  equal  to  100A  and  100/im,  respec¬ 
tively.  As  in  case  of  cold  cathodes  whose  emission  window  as  a  rectangular  geometry  [20], 
Fig.  15  indicates  that  to  limit  the  temperature  rise  in  any  cathode  to  less  than  200  K,  the 
dc  bias  must  be  limited  to  a  smaller  range  for  emitters  with  smaller  window  radius.  For 
instance,  according  to  Figures  11  and  15,  a  cathode  with  a  20/jm  diameter  can  be  operated 
up  to  8.3  V  with  negligible  self-heating  effects  (AT  around  100  K  ).  For  that  bias,  Figure  12 
indicates  that  current  crowding  would  be  negligible  in  the  cathode  and  the  emitted  current 
density  would  be  around  100  A /cm2  (See  Fig.  16).  On  the  other  hand,  Figure  11  indicates 
that  a  100 fim  diameter  window  can  be  operated  up  to  6.8  V  before  current  crowding  becomes 
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non  negligible.  At  this  bias,  the  emitted  current  density  would  be  around  15  A  /cm2  (Fig. 
16)  while  the  temperature  rise  in  the  device  would  be  only  about  15  K  as  shown  in  Fig.  15. 

IV.  CONCLUSIONS 

We  have  proposed  a  new  cold  cathode  emitter  which  consists  of  a  thin  wide  bandgap  semi¬ 
conductor  material  sandwiched  between  a  metallic  material  or  heavily  doped  semiconductor, 
and  a  low  work  function  semimetallic  thin  film.  We  have  shown  that  the  capture  of  electrons 
by  thin  semimetallic  layers  grown  on  the  escape  surface  of  wide  bandgap  semiconductors  can 
lead  to  a  dynamical  shift  of  the  work  function  of  the  semimetallic  layers  together  with  an 
increase  of  the  cathode  emission  current.  While  varying  the  device  and  physical  parameters 
of  the  structure,  our  studies  suggest  that  any  mechanism  which  promotes  additional  charge 
deposit  in  the  well  enhances  the  dynamic  work  function  shift  phenomenon  thereby  increasing 
the  emitted  current.  Potential  material  candidates  were  proposed  for  cold  cathode  operation 
with  applied  bias  under  10  V,  with  current  densities  approaching  several  tens  of  A/cm2,  and 
with  large  power  efficiencies  ( tjp  approaching  15  %). 

The  results  of  our  analysis  show  that  a  cold  cathode  with  either  a  rectangular  or  circular 
emission  window  and  with  the  parameters  listed  in  Tables  I  and  II  would  emit  a  uniform 
current  density  of  about  15 A/ cm2  at  a  dc  biasing  voltage  of  about  8V.  For  that  bias,  the 
effects  of  current  crowding  would  be  negligible  and  the  temperature  rise  in  the  active  area 
of  the  cathode  (CdS/LaS  layers)  as  a  result  of  self-heating  effects  would  be  negligible. 

Further  improvements  to  the  theory  should  include  a  more  realistic  model  for  the  emis¬ 
sion  current  and  transport  through  the  wide  bandgap  material.  Also,  the  semimetallic  film 
energy  density  of  states  (to  account  for  the  d-band  character  of  the  conduction  band  in  the 
chosen  rare-earth  semimetallic  samples  [14]),  the  finite  probability  for  electron  wavefunctions 
in  the  thin  semimetallic  films  to  extend  in  the  semiconductor  material  [22],  a  more  accurate 
description  of  the  energy  loss  mechanisms  [23]  and  screening  effects  (including  the  lateral 
ohmic  voltage  drop)  in  thin  semimetallic  layers  [24].  Finally,  our  study  of  self-heating  effects 
should  include  partial  cooling  of  the  cathode  due  to  heat  conduction  through  the  thick  LaS 
layers  which  was  neglected  in  this  study.  The  latter  would  allow  to  extend  slightly  the  dc 
biasing  operating  range  of  the  cathode  beyond  the  estimate  reported  here.  Furthermore,  the 
thermal  and  electrical  models  of  the  cathode  described  here  should  be  solved  self-consistently. 
Once  all  these  effects  are  taken  into  account,  we  believe  the  quantitative  operation  of  the 
cold  cathode  exposed  here  will  stay  essentially  correct  predicting  a  dynamical  shift  of  the 
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work  function  of  the  thin  semimetailic  film  of  the  same  order  of  magnitude  than  the  one 
reported  here. 

Our  analysis  provides  the  basic  design  rules  to  fabricate  a  new  cold  cathode  with 
emission  windows  with  a  rectanguler  or  circular  geometry.  The  growth  of  the  structure 
would  require  the  epitaxial  growth  of  the  structure  shown  in  Fig.l.  As  discussed  above,  the 
epitaxial  growth  of  InP/CdS  heterostructures  has  been  reported  in  the  literature  in  the 
past  [9].  The  deposition  of  epitaxial  LaS  thin  films  has  not  been  reported,  to  the  best  of 
our  knowledge.  We  believe,  however,  that  the  figures  of  merits  of  the  various  cold  cathodes 
analyzed  in  this  work  are  a  strong  incentive  towards  the  experimental  investigation  of  these 
devices.  If  successful,  such  an  experimental  effort  would  lead  to  big  pay-offs  with  the  design 
of  highly  efficient  cold  cathodes  for  large  panel  displays,  IR  image  convertors  and  sensors,  and 
active  power  devices  in  mobile  and  airborne  electronic  equipment  for  military,  commercial, 
and  private  use. 
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Table  I:  Material  Parameters  of  the  Cold  Cathode 


Material 

Au  (Ag) 

n-n-  -InP 

i-CdS 

LaS 

Lattice  Thickness  (A) 

optional 

optional 

300.0 

24.6 

Lattice  Constant  (A) 

4.04  (4.09) 

5.86 

5.83 

5.85 

Workfunction  (eV) 

4.3  (4.3) 

4.4 

4.2 

1.14 

Bandgap  (eV) 

1.42 

2.5 

#  of  free  electrons  (1022 cm-3) 

n++ 

5.9  (5.86) 

— 

1.99 

Electron  Mass  (mo) 

1.0 

0.0765 

0.14 

1.0 

Electron  Mobility  (cm2V-1s-1) 

5370. 

400.0 

Thermal  conductivity  @  300K  (W/cmK) 

3.1  (4.18) 

0.74 

0.05..1 

0.17 

Electrical  resistivity  (273K)  (/xf \cm) 

1.51  (2.04) 

92.0 

Melting  temoerature  (K) 

1335. 

2500 

Table  II:  Physical  Parameters  of  the  Cold  Cathode 


Thickness  of  InP  substrate  50  -  200  /xm 

Thickness  of  CdS  thin  film  300  -  500  A 

Emission  window  length  1  cm 

Thickness  of  LaS  thin  film  24.6  A 

Thickness  of  LaS  thick  regions  100  -  500  A 


Electron  mean  free  path  in  LaS  thin  film  50  -  300  A 
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Table  III:  Parameters  for  numerical  fit  (Eq.(10))  to  the  Fowler- Nordheim  current 
expression  (Eq.(9)).  The  fit  is  made  over  a  current  density  range  from  1  to  1000  A/cm2. 


Width  of  Rectangular  Window  (/im) 

Jo  (A/cm2) 

a  (V») 

10,000 

7.21S6X10"7 

2.3651 

1,000 

3.6483X10'6 

2.1302 

500 

3.8505X10"6 

2.1224 

200 

4.7337X10-6 

2.0933 

100 

4.7902X10-6 

2.0916 

50 

4.8207X10"6 

2.0907 

20 

4.8424X10-6 

2.0901 

10 

4.8424X10-6 

2.0901 
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Figure  1:  (a)  Left:  cross-section  of  the  newly  proposed  cold  cathode  between  two  emitter  fingers.  Trapping 
of  electrons  by  the  LaS  semimetallic  thin  film  leads  to  a  lateral  current  flow  and  current  crowding  in  the 
structure.  Right:  illustration  of  the  partial  reflection  of  the  two-dimensional  electron  gas  in  the  LaS  thin 
film  upon  entering  the  three-dimensional  contact  regions  where  the  external  bias  is  applied  to  Au  contacts 
made  to  the  thick  LaS  regions,  (b)  Illustration  of  the  multiple  finger  metallic  structure  used  to  bias 
appropriately  a  cold  cathode  with  rectangular  emission  window. 


(C) 

Figure  1  (cont’d):  (c)  Array  of  circular  cold  cathode  emitters  arranged  in  a  honeycomb  configuration  and 
close-up  view  of  one  of  the  emitters  showing  the  various  layers  in  the  epitaxially  grown  structure  and  the 
contacts  made  to  the  LaS  metallic  grid  and  the  n++  —  InP  heavily  doped  injection  layer.  (After  W.  Friz, 
Task  ELM-2,  "Dissipative  Processes  in  Veiled  Work  Function  Emitters”,  Contract  F  33615-C-95-1755, 
WPAFB,  Dayton,  May  1996]. 
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Figure  2:  Schematic  representation  of  the  conduction  band  profile  throughout  the  cold  cathode  emitter 
described  in  the  text.  Under  forward  bias,  a  fraction  of  the  emitted  current  is  captured  in  the  semimetallic 
slab.  The  subsequent  excess  sheet  carrier  concentration  in  the  quantum  well  formed  by  the  semimetallic 
slab  leads  to  a  shift  of  the  fermi  level  in  the  thin  film  which  is  similar  to  a  lowering  of  the  work  function  of 
the  thin  film.  For  a  given  forward  bias,  this  leads  to  an  increase  in  the  electric  field  in  the  wide  bandgap 
semiconductor  (dashed  line  versus  full  line)  and  in  an  increase  in  the  injection  and  emitted  currents.  Also 
shown  in  the  quasi  Fermi  level  spatial  dependence  across  the  wide  bandgap  semiconductor. 
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Figure  3:  Dynamical  shift  of  the  work  function  as  a  function  of  the  external  applied  bias  for  a  cold 
cathode  emitter  with  the  parameters  listed  in  Table  I.  For  each  group  of  curves,  r  =  0.99,  0.9,  and  0.75, 
from  left  to  right.  The  following  physical  parameters  were  used:  A  Las  =  300  A,  7\  =  T2  =  0.5,  and 
Vf  =  1.36X10®  cm/s. 


Figure  4:  Comparison  between  the  emitted  current  as  a  function  of  the  applied  bias  while  including  (dashed 
line)  and  neglecting  (full  line)  the  dynamical  shift  of  the  work  function  of  the  semimetallic  slab  described 
in  the  text.  The  parameters  of  the  device  are  listed  in  Table  I.  The  coefficient  C\  and  C2  in  Eq.(l)  were 
chosen  equal  to  1.5X106  A/V  and  6.9X107  (Vl/2cm)~l,  respectively.  Other  choices  for  the  parameters 
Ci  and  C2  with  similar  magnitudes  lead  to  similar  dynamical  shift  of  the  work  function.  The  reflection 
coefficient  r  shown  in  Fig.  1(a)  was  set  equal  to  0.99.  The  following  parameters  were  used:  XtaS  =  300  A,  ! 
T\  =  To  =  0.5,  and  v f  =  1.36XlOa  cm/s. 
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(a) 


(b) 

Figure  5:  (a)  Bias  dependence  of  the  dynamic  work  function  shift  in  a  typical  cold  cathode  while  varying 
the  reflection  coefficient  of  electrons  between  the  2D  thin  film  and  LaS  contact  regions,  (b)  Correspondin 
bias  dependence  of  emitted  current  density.  The  following  parameters  were  used  (L\  =  500  A,  Li  =  24. 
A,  W  =  1  cm,  Ti  =  To  =  0.5,  XLaS  =  300  A,  A(.4a)  =  0.56eV  ). 
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bo  <r) 


(b) 

Figure  6:  Same  as  Figure  5  for  a  cathode  with  a  300  A  thick  CdS  region,  all  other  parameters  being  kept 
the  same.  Comparison  with  Figure  5  shows  that  the  exponential  rise  of  the  dynamic  work  function  shift 
and  the  emitted  current  density  occurs  at  smaller  values  of  the  applied  bias. 
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(a) 


(b) 

Figure  7:  (a)  Bias  dependence  of  the  dynamic  work  function  shift  in  a  typical  cold  cathode  while  varying  the 
transmission  coefficients  7\  and  T2  at  the  CdS / LaS  and  XaS/vacuum  interfaces,  (b)  Corresponding  bias 
dependence  of  emitted  current  density.  The  full  and  dashed  lines  are  the  current  density  plots  calculated 
with  and  without  the  effect  of  the  dynamic  work  function  shift.  The  following  parameters  were  used  {L\ 
=  300  A,  Lo  =  24.5  A,  W  =  1  cm,  A LaS  =  300  A,  and  A(.43)  =  0.56eV  ).  The  curves  are  labelled  with  the 
values  of  the  transmission  probabilities  7\  and  To  which  were  assumed  to  be  identical. 
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(b) 

Figure  8:  (a)  Bias  dependence  of  the  dynamic  work  function  shift  in  a  typical  while  varying  the  electron 
mean  free  path  XLaS.  (b)  Corresponding  bias  dependence  of  emitted  current  density.  The  following 
parameters  were  used  (£,  =  300  A,  Lo  =  24.6  A,  W  =  1  cm,  A ias  =  300  A,  A(*43)  =  0.56eV,  T\  =  T2  = 
0.5.  and  r  =  0.99). 
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1/v„u.  V  1  I 


(a) 


(b) 

Figure  9:  (a)  Fowler-Nordheim  fit  to  the  bias  dependence  of  the  emission  current  of  a  cold  cathode  with 
the  physical  parameters  listed  in  the  caption  of  Figure  5.  (b)  Fit  of  the  same  versus  calculation 
by  an  expression  of  the  type  Jfh  =  «/oeaV|,“’  for  a  range  of  Vj,a3  for  which  the  emitted  current  density 
varies  between  1  and  1000  Aj cm2. 
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V-107.1X)  R=  0.99S95 


Figure  10:  Fowler-Nordheim  fit  to  the  bias  dependence  of  the  dynamic  work  function  shift  for  a  cold 
cathode  with  the  physical  parameters  listed  in  the  caption  of  Figure  5.  The  following  parameters  were 
used  ( Lx  =  300  A,  L7  =  24.6  A,  W  =  1  cm,  =  300  A,  and  A(^,)  =  0.56eK  ). 


Figure  1=1:  Bias  dependence  of  the  parameter  c  as  a  function  of  applied  bias  for  different  radii  of  a  cold 
cathode  emitter  with  a  circular  geometry.  Cold  cathodes  with  narrower  emitter  window  must  be  operated 
over  a  smaller  dc  bias  range  to  avoid  current  crowding.  The  following  parameters  were  used  (L\  =  300 
A,  L-z  —  24.6  A,  W  =  1cm,  A  us  =  300  A,  &[AS)  =  0.56eV,  and  r  =  0.99).  Current  crowding  effects  are 
negligible  as  long  as  the  parameter  c  is  kept  under  0.05. 
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Figure  12  :  Radial  variation  of  the  potential  drop  across  the  LaS  windows  of  various  radii.  The  cathode  is 
assumed  to  operate  at  room  temperature  and  the  electrical  resistivity  of  LaS  is  set  equal  to  92  The 

potential  is  measured  relative  to  the  potential  at  the  rim  of  the  LaS  circular  window  where  the  electrostatic 
potential  is  assumed  to  be  equal  to  the  applied  bias.  For  each  figure,  the  values  of  the  normalizing  current 
density  are  from  top  to  bottom  1,  10,  100,  1000  A/cnrr.  The  LaS  window  radius  is  indicated  on  top  of. 
each  figure.  The  following  parameters  were  used  (Lx  =  300  A,  L2  =  24.6  A,  W  =  1cm,  A ^  =  300  A, 
and  —  Q.56eV  ).  Right  frames  :  corresponding  radial  dependence  of  the  emitted  current  densities. 

Current  densities  are  normalized  to  the  values  of  the  emitted  current  at  the  window  rim. 
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Figure  13  :  Variation  with  the  parameter  c  of  the  different  contributions  to  the  total  power  dissipated  in 
circular  cold  cathodes  with  different  radii  (see  Eqns. (32-39))-  The  physical  parameters  of  the  cathodes  are 
listed  in  Table  I  and  in  the  caption  of  Fig.5.  The  CdS  layer  is  assumed  to  be  grown  on  a  heavily  doped  InP 
substrate  lOO^m  thick.  The  thermal  conductivity  of  the  InP  substrate  was  taken  equal  to  0.74  W/Kcm 
and  the  parameter  b  in  Eq.(L3  )  was  set  equal  to  1.45.  The  thickness  of  the  LaS  contact  regions  was  set 
equal  to  100  A. 
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V>U.  M 

(a) 


(b) 

Figure  14  :  Power  efficiency  rjp  of  cathodes  of  different  radii  is  plotted  as  a  function  of  (a)  and  (b) 
the  emitted  current  density  For  all  window  sizes,  the  efficiency  decreases  with  and  Jem  as  a 
result  of  current  crowding.  The  efficiency  is  more  or  less  constant  over  a  wider  range  of  for  window 
with  smaller  radius  because  current  crowding  is  less  important  in  that  case,  as  illustrated  in  Fig.  12  .  The 
overall  lower  efficiency  for  smaller  size  window  comes  from  the  fact  that  the  width  b  of  the  thick  LaS 
regions  was  set  equal  to  100/im  for  ail  cathodes.  The  efficiency  of  cathodes  can  be  increased  by  making 
the  ratio  b/a  in  Fig.l  closer  to  unity.  The  following  parameters  were  used  (Lt  =  300  A,  L ?  =  24.6  A,  W 
=  icm,  XLaS  =  300  A,  A(.-ij)  =  0.56eV  ,  7\  =  To  =  0.5,  and  r  =  0.99). 
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Figure  15 :  Temperature  rise  in  the  LaS  thin  film  as  a  function  of  the  applied  bias  for  the  various  cold 
cathode  structures  studied  in  Figure  13.  The  active  area  of  the  cathode  (CdS/LaS  layers)  is  assumed  to 
be  grown  on  a  100/im  thick  InP  substrate  with  the  back  of  the  substrate  acting  as  a  perfect  heat  sink 
(300K). 
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A  NOVEL  COMPATIBILITY/  EQUILIBRIUM  BASED 
ITERATIVE  POST-PROCESSING  APPROACH  FOR 
AXISYMMETRIC  BRITTLE  MATRIX  COMPOSITES 

Reaz  A.  Chaudhuri,  Associate  Professor 
Department  of  Materials  Science  &  Engineering,  University  of  Utah 

Abstract 

A  semi-analytical  iterative  approach  for  enhancing  the  existing  two-dimensional  quasi- 
continuous  axisymmetric  stress  field  for  a  brittle  matrix  micro-composite  (i.  e„  a  single  fiber 
surrounded  by  a  concentric  matrix  cylinder),  is  presented.  The  existing  solution  employs 

Retssner's  variational  theorem  in  conjunction  with  an  equilibrium  stress  field  in  which  the 
radial  (r-)  dependence  is  assumed  a  priori. 

In  the  present  approach,  the  stress  distribution  in  the  radial  direction  obtained  from  the 
afore-cited  variational  model  is  improved  a  posteriori  through  an  iterative  approach  that 
involves  successive  substitution  of  the  previously  computed  strains  (or  stresses)  into  the 
equations  of  compatibility  and  equilibrium.  The  equations  of  compatibility  are  selected  such 
that  they  form  Euler  equations  corresponding  to  appropriate  variational  principle,  such  as  the 
principle  of  minimum  complementary  potential  energy,  etc.  The  boundary/interface  conditions 
at  r  =  constant  and  z  =  constant  surfaces/interfaces  are  satisfied  in  the  pointwise  sense.  The 
expressions  for  the  improved  axisymmetric  displacement  and  stress  fields  are  derived  using  the 
symbolic  language,  MAPLE.  An  illustrative  thermal  stress  problem  is  currently  being  solved, 
and  will  be  used  to  compare  with  the  existing  variational  solution. 
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1.  Introduction: 

Studies  of  the  behavior  of  unidirectional  and  laminated  composites  made  from  stiff 
elastic  matrix  materials  which  may  develop  imperfect  interfaces  with  the  fibers  have  enjoyed  a 
revival  after  the  early  classical  work  of  Aveston,  Cooper,  and  Kelly  (1971).  Pagano  (1991) 
refers  to  these  materials  as  brittle  matrix  composites  (BMC).  The  aforementioned  ACK 
modeling,  as  well  as  the  more  recent  development  proposed  by  Budiansky,  Hutchinson,  and 
Evans  (1986),  are  based  upon  primitive  approximations  of  the  stress  field  developed  within  a 
concentric  cylinder,  i.e.,  a  circular  cylindrical  body  of  one  material  surrounded  by  a  concentric 
annulus  or  ring  of  a  second  material.  There  exists  a  considerable  body  of  literature  associated 
with  the  elasticity  problem  of  a  concentric  cylinder,  where  modem  interest  is  focused  on  its  use 
as  a  representative  volume  element  (RVE)  of  a  unidirectional  composite  (Hashin  and  Rosen, 
1964;  Pagano  and  Tandon,  1988).  Additionally,  a  significant  segment  of  the  composite 
literature  is  based  upon  the  one-dimensional  shear  lag  analysis,  which  was  apparently 
originated  by  Cox  (1952).  The  results  obtained  using  this  kind  of  analysis  are  too  inaccurate  to 
merit  further  attention  in  this  report. 

Sternberg  (1970)  solved  several  axisymmetric  load  diffusion  problems  within  a 
concentric  cylindrical  domain,  in  which  the  elasticity  formulations  are  simplified  by  the 
assumption  that  the  core  material  (fiber)  can  be  modeled  as  a  one-dimensional  bar  while  the 
annular  region  (matrix)  in  all  cases  extends  to  infinity.  An  exact  solution  was  derived  for  the 
case  in  which  the  bar  was  circular  in  cross-section,  and  was  fully  immersed  within  the 
unbounded  matrix.  A  similar  approach,  with  the  difference  of  the  fiber  being  assumed  rigid, 
was  employed  by  Luk  and  Keer  (1979).  This  assumption  is  more  appropriate  for  resin  matrix 
composites,  where  the  fiber  to  matrix  modulus  ratio  is  very  high.  The  axisymmetric  elasticity 
problem  of  a  broken  fiber  embedded  in  an  infinite  matrix  was  treated  by  Pickett  and  Johnson 
(1967).  In  that  work,  the  fiber  is  represented  as  a  three-dimensional  elastic  medium;  however, 
the  report  contained  no  numerical  results  for  the  stress  field.  Smith  and  Spencer  (1970)  also 
formulated  an  axisymmetric  elasticity  solution  by  a  semi-inverse  method  for  a  class  of 
boundary  value  problems  in  which  the  radius  and  length  of  the  concentric  cylinder  are  both 
finite.  The  solution  is  expressed  in  the  form  of  a  series  of  non-orthogonal  functions  which 
satisfy  the  field  equations  exactly.  Homogeneous  boundary  conditions  on  the  radial  surface  of 
the  body  are  satisfied  exactly  while  realistic  end  conditions  can  be  approximated.  The 
singularities  predicted  by  Zak  (1964)  are  smoothed  out  in  this  approach.  A  very  formidable 
study,  which  includes  correlation  with  experimental  observations,  is  that  by  Atkinson  et  al 
(1982).  In  this  work,  the  pullout  of  a  single  fiber  from  a  matrix  cylinder  is  treated.  The  fiber 
extends  only  partially  along  its  length  into  the  matrix.  Results  are  provided  for  a  perfectly 
bonded  fiber-matrix  interface  as  well  as  for  states  in  which  lateral  (curved)  surface  debonding 
or  fiber  end  plane  debonding  take  place.  The  problem  is  solved  by  "patching"  the  asymptotic 
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smgular  stress  field  to  that  given  by  finite  elements,  although  this  method  is  not  always 
reliable.  An  .merest, „g  COIlclusion  is  reached  thati  „  least  in  a  qualitaIive  SMSe  the  interface 

a,  ure  response  can  be  anttc.pated  from  the  stress  field  within  the  uncracked  rod.  In  a  model 
similar  to  Pagano's  (1991),  McCartney  (1990)  treated  a  class  of  concentric  cylinder  problems 
,n  which  matrix  cracking  or  debonding  with  or  without  friction  are  present.  In  that  work,  the 
functional  r-dependence  of  the  stress  components  is  assumed  which  leads  to  a  system  of 
ordtnary  differential  equations  in  z.  All  appropriate  field  equations  are  satisfied  with  the 
except, on  of  two  of  the  constitutive  relations  while  some  of  the  boundtuy/interface  conditions 
ould  only  be  satisfied  in  an  average  sense.  The  ease  and  potential  effectiveness  of  this  model 
or  composite  analysts  will  demand  its  careful  consideration  in  comparison  with  exact  solutions 
and  solutions  given  by  competttive  approaches.  For  example,  Kurtz  and  Pagano  (1991) 
ormulated  an  infinite  senes  solution  of  the  axisymmetric  elasticity  problem  in  which  a  fiber  is 
being  pulled  from  the  matrix.  The  lengti,  of  the  body  as  well  as  the  outer  radius  are  finite 
Although  die  singularity  is  no,  explicit  in  die  solution,  Cestuo  summation  has  been  employed  to 
improve  the  convergence  of  the  stress  field  within  the  singular  region. 

solutjon  for  staled  Pr°Wem  has  been  obtained  by  employing  a  modified  version 
of  the  variational  model  (Pagano,  1991)  of  an  axisymmetric  concentric  cylinder,  which  was 
successfully  implemented  earlier  in  die  case  of  a  flat  laminate  by  Pagano  ( 1 978).  The  model  is 
generated  by  subdividing  the  body  into  regions  consisting  of  a  core  and  a  number  of  shells  of 
constant  thtckness  and  length  and  satisfying  die  Reissner  variational  equation  (1950)  with  an 
assumed  stress  field  in  each  region.  The  number  of  regions,  in  particular  in  the  r  direction,  can 
mcreased  in  order  to  tmprove  solution  accuracy.  The  regions  are  selected  such  tha,  the 
herrno  lastic  properties  am  constant  and  die  boundtuy  conditions  do  no,  change  character  on 
y  O  e  oundtng  surfaces  within  each  region.  Pagano  (1991)  has  sought  to  predict  the 
uence  of  various  kinds  of  damage  (see  Figure  1)  and  their  interactions  by  accurately 

approach. ^  mKr0‘me"  fidd  in  “’-presence  by  using  die  afoie-cited  variation* 

Strengths  and  Weaknesses  of  the  Existing  Approach  fPapann  i  oon 
Strengths: 

equTatioSns;SUmed  ^  aPPr°aCh  'nSUrCS  Sa‘iSfaC,i°"  °f  ‘he  axis*mnKtric  equilibrium 

2Jhe  existing  method  utilizes  a  reasonably  accurate  non-singular  axial  (z-direc,ion)  variation 

he  computed  stress  field  that  also  satisfies  the  end  (z  =  const.)  boundary  conditions  (i  e 
no  artificial  discontinuity  due  to  sectioning).  *  '  *’ 

Weaknesses: 

''  LayermS  in  thC  radial  direction'  ,hal  “boduces  artificial  discontinuities  in  some  stress 
components  at  a  layer  interface  within  the  same  material; 
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2.  As  the  number  of  layer  increases,  the  computed  eigenvalues  become  numerically  very  large, 
thus  limiting  the  number  of  layers  required  for  accurate  stress  field  in  the  vicinity  of  a  stress 
singularity  point. 

Objectives  of  the  Current  Research: 

The  present  method  seeks  to  alleviate  the  afore-mentioned  weaknesses  of  the  existing 
approach,  while  preserving  its  inherent  strengths.  The  specific  goals  of  the  present 
investigation  includes 

-  Improvement  of  the  radial  (r-)  variation  of  the  stresses  so  that  subdivision  into  very  thin 
layers  and  the  associated  blow-up  of  the  computed  eigenvalues  can  be  avoided; 

-  Artificial  discontinuities  of  the  stress  components,  Gq  and  oz,  at  a  layer  interface  within  the 
same  material  can  be  avoided; 

-  Determination  of  the  "stress  intensity  factor"  by  way  of  matching  the  local  asymptotic  stress 
field  with  the  afore-cited  improved  solution  at  an  arbitrarily  close  distance  from  the  point  of 
stress  singularity  (i.  e.,  fiber-matrix  interface  located  at  a  free  edge). 

As  a  first  step,  Chaudhuri  (1996)  presented  an  approximate  "plane  strain"  version  of 
the  present  solution,  wherein  the  equations  were  simplified  by  dropping  1/r  terms  in  the 
equilibrium  equations.  Additionally,  the  constitutive  relations  used  were  those  due  to  the  plane 
strain.  The  primary  reason  for  resorting  to  this  approximation  was  the  fact  that  Chaudhuri's 
(1996)  previously  "improved"  axisymmetric  solution  generated  10  "constants"  of  integration 
for  a  2-layer  fiber-matrix  cylinder  model  against  8  interface/boundary  conditions  in  the  radial 
direction.  Plane  strain  approximation  removed  2  constants  of  integration,  thus  rendering  the 
boundary-value  problem  under  investigation  solvable  in  closed  form.  However,  although  the 
preliminary  results  thus  obtained  demonstrated  that  the  plane  strain  based  approach  can  be 
implemented  in  the  existing  FORTRAN  code  due  to  Pagano  and  co-workers  (1991),  the 
accuracy  of  the  results  were  far  from  encouraging.  This  discrepancy  was  due  to  the  fact  that 
although  the  plane  strain  condition  prevails  in  the  vicinity  of  the  point  of  stress  singularity  (i. 
e„  fiber-matrix  interface  located  at  a  free  edge)  as  was  shown  by  Zak  (1964),  the  same  is  not 
true  in  the  far  field  where  the  boundary  conditions  are  applied.  The  present  (summer,  1997) 
research  has  corrected  this  situation  by  rederiving  the  correct  axisymmetric  solution  with  4 
"constants"  (i.e.,  functions  of  z)  of  integration  per  layer.  This  is  currently  being  implemented 
in  the  framework  of  symbolic  language  MAPLE  software,  for  the  purpose  of  solving  a  system 
of  simulaneous  ODE's  in  terms  of  the  integration  "constants"  (functions  of  z). 

Secondly,  although  the  boundary/interface  conditions  at  r  =  constant  surfaces  were 
satisfied  in  the  pointwise  sense,  the  past  (summer,  1996)  approach  left  the  end  boundary 
conditions  in  the  axial  direction  (i.e.,  at  surfaces  z  =  constant)  undefined  thus  rendering  the 
boundary-value  formulation  ill-posed.  This  ill-posedness  was  removed  by  introducing 
boundary  error  terms  that  helped  satisfy  the  end  boundary  conditions  at  surfaces  z  =  constant  in 
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a  somewhat  ad  hoc  fashion.  The  present  (summer,  1997)  research  has  identified  a  novel 

approach  wherein  the  z-boundary  conditions,  that  can  be  satisfied  in  a  pointwise  sense,  ate 

generated  using  the  equations  of  the  theory  of  elasticity  within  a  particular  cycle  or  iteration. 

owever,  this  discovery  came  too  late  to  be  implemented  in  the  framework  of  MAPLE 

durmg  the  summer  of  1997.  Once  the  afore-mentioned  boundtuy  problem  is  solved  using  the 

MAPLE,  the  solution  will  be  implemented  into  the  existing  ADM  Code  of  Pagano  and  co- 
workers  (1991). 

2.  Solution  Strategy 

Starting  point  of  the  present  research  is  Pagano's  (1991)  layerwise  axisymmetric 

sohihon  f„r  fiber-matrix  concentric  cylinder  model,  based  on  Reissner's  variational  theorem 

(1950)  in  conjunction  with  an  equilibrium  stress  field,  in  which  the  radial  (r-)  dependence  is 

assumed  a  priori.  An  approximate  model  was  formulated  to  define  the  thermoelastic  response 

of  a  concentric  fiber-matrix  cylindrical  body  under  axisymmetric  boundary  conditions  The 

interfaces  between  continguous  cylinders  may  be  either  continuous  or  subjected  to  mixed 

traction  and  displacement  boundary  conditions.  The  external  surfaces  may  be  subjected  to 

mixed  boundary  conditions  that  are  consistent  with  the  model  assumptions  but  otherwise 
arbitrary. 

In  what  follows,  an  improved  stress  field  within  a  layer  is  derived  starting  from 
Pagano's  layerw.se  axisymmetric  fiber-matrix  concentric  cylinder  solution.  The  stress 
distribution  in  the  radial  direction  obtained  from  the  afore-cited  layerwise  fiber-matrix 
concentric  cylinder  model  (Pagano,  1991)  is  improved  n  posteriori  through  an  iterative 
approach  that  involves  successive  substitution  of  the  previously  computed  strains  (or  stresses) 
into  the  equations  of  compatibility  and  equilibrium.  A  similar  procedure  was  implemented  in 
t  e  post-processing  part  of  a  layerwise  finite  element  code  for  analysis  of  quasi-three- 
dimensional  laminated  pla.es/shells  to  obtain  a  mom  accurate  through-thickness  distribution  of 
inter  ammar  shear  stresses  (see  Chaudhuri  and  Seide,  1987;  and  Chaudhuri,  1990)  The 
boundary/interface  conditions  at  r  =  constant  surfaces  are  to  be  satisfied  in  the  pointwise  sense 
hus  eliminating  artificial  discontinuity  in  computed  o9(r,z)and  o2(r,z)  across  a  layer- 
interface  within  the  same  material. 

Pagano's  H99n  MoHpI 

Pagano  (1991)  considered  an  arbitrary  region  within  the  body  defined  by  inner  and 
outer  rad, ,r,  and  r^,  respectively  and  end  planes  z  =  z,.z2as  shown  in  Figure  2  He 
introduced  a  right-handed  cylindrical  coordinate  system  z,  6,  r,  and  employed  a  contracted 
notation  in  the  representation  of  the  stress  and  strain  components,  i.e.. 


CTl  ~  a“’  a2  =  <% ,  O3  =  CTn-,  O5  =  Gn 


(1) 
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and  the  analogous  relation  for  the  engineering  strain  components  £j  (i=l,  2,  3,  5).  The  r,  z 
components  of  displacement  are  designated  as  u,  w,  respectively. 

The  form  of  the  stress  distribution  within  the  annular  region  is  assumed  to  be  given  by 

=  Pijf?  |i  =  1, 2,  3, 5;  J  =  1, 2,  —5  >  (2) 


where  pu  are  functions  of  z  only.  In  order  to  avoid  confusion  with  layer  superscript  k,  the 

index  i  in  fj(>)  is  placed  in  parentheses.  For  a  region  in  which  rj  *  0,  the  functions  fjO)  are 
defined  by 


+  rir2  +i2)r  +  rir2(ri  +r2) 


(ri  *  0) 


f(5)  _  (ri  +  r2)r2  -  (r?  +  r}r2  +  r^)r  j 

3  "  M  +  ' 

with 


PiJ  =  fj  ^  =  0  (tj  *  0;  i  =  1, 2  and  J  =  3, 4, 5  or  i  =  5  and  J  =  4,  5  |  (4) 

In  other  words,  the  functions  fj<>)  and  the  corresponding  pj  not  displayed  in  (3)  all  vanish.  In 
the  event  that  ri  =  0  (fiber  core),  eqn  (3)  is  replaced  by 


4,)»f?)«43)*45) 

43)  =  (r2-^)r  , 


_  x 


(ri  =  0) 

43)=45,  =  (-r2)r 


(5) 


with 

PiJ  =  fj  *  =  o  (n  =  0;  i  =  1,  2  and  J  =  3,  4  or  i  =  5  and  J  =  1, 4  or  J  =  5)  (6) 

It  may  be  noted  that  the  superscripts  k  have  been  omitted  in  eqns  (2)  -  (6)  to  avoid  unnecessary 
congestion.  The  relations  (2)  -  (6)  arise  by  assuming  that  aj  and  a2  are  linear  functions  of  r  in 
the  region  and  then  determining  the  form  of  the  remaining  stress  components  from  the 
equations  of  equilibrium  of  axisymmetric  elasticity  subjected  to  the  following  conditions 


6-7 


Pia  (2)  =  Of  (ra,  z)  (i  =  1,  2, 3,  5;  a  =  1,  2)  (7) 

Thus,  the  p  functions  are  equal  to  actual  stresses  at  r  =  rj ,  r2. 

The  general  form  of  Pagano's  (1991)  solution  for’any  of  the  dependent  variables  P(z)  is 
expressed  by 

P(Z)  =  X  Ai  e»*  +  Pp(z)  (8) 

within  each  constituent  where  A,  are  constants,  X,  are  eigenvalues  of  a  determinant,  and  P„(z) 
is  a  particular  solutton,  which  in  the  present  case  is  a  simple  polynomial.  Further  details  of  the 
solution  procedure  including  the  method  for  determining  the  higher  order  eigenvector  and 
higher  order  particular  solution  are  discussed  by  Brown  (1992), 

Improved  Stress  Field  in  a  Fiber  or  Matriv  I  ay*r 

An  examination  of  Pagano’s  (1991)  axisymmetric  fiber-matrix  concentric  cylinder 
model  reveals  that  he  has  assumed  a  linear  variation  of  axial  stress,  0i(r,z)  and  hoop  stress 
e(r,z),  with  respect  to  r  within  a  fiber  or  matrix  layer.  This  is  consistent  with  the  assumption 
of  Love-Kirchhoffs  thin  shell  theoiy.  However,  he  derived  the  interlaminar  shear  stress 
t„(r,z),  and  the  radial  stress.  G,(r,z),  using  the  equations  of  equilibrium  in  line  with  his 
earlier  work  (see  Pagano,  1969)  before  substituting  these  stresses  into  Reissner’s  (1950) 
variational  principle.  In  what  follows,  improved  deformation  and  stress  fields  based  on 

successive  use  of  compatibility  and  equilibrium  equations  of  axisymmetric  elasticity  theoty  is 
derived  m  the  annular  and  core  regions,  respectively. 

Annular  Laver: 

Pagano’s  stress  field  for  an  annular  (fiber  or  matrix)  layer  k  =  1, 2, ...  is  as  shown  below: 

5jk)  =  p!{)(z{-^]+p«)(z/-!^iL) 

fo  -r, 


+  p8WJ_!l 

V  r2  ~  rl 


^Vr,z)=  p'«(z)i^  +  p?2.(2)Jlr^+  k  /  (q_+  r2)r2  (r,2  +  r,r?  +  r2  )r  1 


rl(r2  ~rl) 


r2(r2  -  rj) 


a!k)(r,z).=  p^)(2)  J2__L  +  p(k)(z)[ Izil  Loo,  j  3  (  2  ,  x 

\r2-rj  P32  ^r2_ri  J+P33  (z)(r  (rj  +r,r2 +r22)r  +  r,r2(r, +r2)) 

+Pm  (Z)(r2  -  (r,  +  r2)r  +  r,r2)  +  p'k5>(z)(r2  -fr.  +  r2)r  +  r,r2)-L  (9c) 

It  is  worthwhile  to  point  out  here  that  although  the  b  Jed  stresses  satisfy  the  equations 
of  equilibrium  for  an  axisymmetric  elastic  body,  the  corresponding  strains  fail  to  satisfy  the 
q  ns  of  compatibility  in  the  pointwise  sense.  It  is  noteworthy  that  although  both  the  plane 
strain  and  axisymmetric  deformations  represent  two-dimensional  stales,  the  latter  case  requires 
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four  compatibility  equations  to  be  satisfied  by  the  strains  computed  using  Pagano’s  (1991) 
axisymmetric  variational  model.  This  is  in  contrast  to  the  case  of  plane  strain,  wherein  only  one 
compatibility  equation  out  of  the  six  is  not  an  identity.  However,  if  the  hoop  stress,  O0,  is 

derived  using  the  exact  axisymmetric  elasticity  based  kinematic  relations  and  Hooke's  law,  then 
three  of  the  four  compatibility  equations  required  by  the  axisymmetric  elasticity  theory  become 
identities.  The  combined  kinematic  and  stress-strain  relations  for  axisymmetric  elasticity  theory 
are  given  as  follows  (Timoshenko  and  Goodier,  1959): 


u(r,z)  =  re6(r,z)  =  — [6e(r,z)  -  vk  (ar(r,z)  +  o2(r,z)}J 

fck 

3u(r,z)  _  1  _ 

~ 3r  =  £r(riz)  =  — [ar(r,z)  -  vk  {c0(r,z)  +  a2(r,z)}] 


(10a) 


(10b) 


The  tilda  quantities  represent  the  "improved"  stresses,  strains  and  displacements,  obtained 
using  the  equations  of  compatibility,  equilibrium,  etc.,  but  may  not  satisfy  the  prescribed 
z=constant  boundary  conditions.  Eliminating  u(k)(r,z)  from  eqns  (10)  and  solving  for 
Oek)(r,z)will  yield 

^ek)(r,z)  =  r-(1+Vk)  j|(l  +  vk )rVk ar(r,z)  +  vkr(1+Vk) +  vkr(1+Vk)  ^^^}dr  +  F,(k)(z) 
.1  or  5r  J  1 

00  .  (11) 
where  F,  (z),  k  -  1,  2,...  is  a  "constant"  of  integration  (with  respect  to  r)  and  an  arbitrary 

function  of  z.  The  hoop  stress  in  an  annular  layer  (k  *  0,  may  be  fiber  or  matrix)  is  obtained  by 
substitution  of  a£k)(r,z)  and  c[k)(r,z)  from  eqns  (9a, c)  into  the  right  side  of  eqns  (1 1) 

d^k|(r,z)  =  ^k)*(r,z)  +  -0—  F<k,(z);  k=  1,2,3,...  (i2a) 


where 


o<k»*(r,z)  =  (l  +  vk)[p(Vw-0— (-2- - — )  +  ptk)(z) — ! — ( — i - 0_) 

(r2-r,)  vk+l  vk  +2  32  (r2 -r,)  vk  +2  vk+lJ 

+  p“<z)l^  - (r'  +  r'r> +  r’ ^  + 


~(rl  +r2)v^t2+7TT,  +  P«  <z)~ f~ ^-(rl  +r2)— !— +— r,r2l] 
vk+2  vk+l  r,r2  vk  +  2  vk  +  l  rvk  1 2 


1  1 


+  Vkt  PsA2)"  ~  — +  P3k>(z) - - - fp3j)(z){ - - - 

(r2  ~ri)(vk  +2)  (r2  —  r,  )(vk  +2)  P33^(vk+4) 


■(r‘+r'rz+rz)^1+p«(z^-<z'+^^) 
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+  P35>(Z){ 


r,r2(vk  +  2)  rvk 


~~  }  Pn’Cz) 


(r2~rI)(vk+2) 


+  Pn)(z) 


(r2-r,)(vk+2) 


] 


(12b) 


The  radial  displacement  component,  u(k)(r,z).  in  the  kth  layer  can  now  be  obtained  by 
substituting  eqns  (12)  into  eqn  (10a)  as  follows: 


u(k)(r,z)  =  u(k)*(r,z)  +  -i— ^E(k)(z);k  =  1,2,. 


~vk 


where 


(13a) 


u(k)*(r,z)  = 


«(*) 


-vkK 


+  4k))}; 


k=  1,2, 


(13b) 


whence  ejk)(r,z)  can  be  obtained  by  direct  differentiation  with  respect  to  r.  Interlaminar  shear 
strain,  Y^riz),  can  now  be  obtained  using  Hooke’s  law: 


f^r,z)  =  ^(r,Z)  (U 

It  may  be  noted  that  the  above  two  strains,  e(rk)(r,z)(or  u)  and  Y(rck)(r,z),  are  not  compatible 
with  the  axial  strain, £< k  >  (r,  z) ,  computed  using  Hooke’s  law 


^»  =  J-[g(k).vk(6^  +  5(«)] 

because  the  remaining  compatibility  equation 
32ez  [  _  d2yn 

dr2  rl72  3r9z 

or  the  ensuing  identity 

d  £z  j  _  d 
dr  dz2  dz 


(15) 


(16a) 


(16b) 


is  not  satisfied.  It  may  be  noted  here  that  since  the  radial  displacement  u<k)(r,z)  has  already 
been  derived  using  the  appropriate  kinematic  and  constitutive  relations  of  axisymmetric 
elasticity  theory,  given  by  eqns  (10),  the  radial  strain,  e;k’(r,z),  will  automatically  be 
compatible  with  the  other  two  strains  of  eqn  (16a),  if  u(k>(r,z)  is  compatible.  Hence,  the 
appropriate  compatibility  equation  to  be  satisfied  is  eqn  (16b)  instead  of  eqn  (16a).  It  is 
noteworthy  that  the  compatibility  equation  (16a)  is  an  Euler  equation  of  the  principle  of 
minimum  complementary  energy,  while  eqn  (16b)  corresponds  to  an  Euler  equation  of  a  yet  to 
be  derived  variational  principle  intermediate  between  the  principle  of  minimum  potential  energy 
and  Reissner's  variational  principle. 
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The  axial  strain,  £*k)(r,z),  is  therefore,  obtained  by  substituting  u(k)(r,z)  and 
Yrz  (r»z)  given  by  eqns  (13)  and  (14)  into  the  integrated  (with  respect  to  r)  version  of  the 
compatibility  equation  (16b)  as  follows: 


(17b) 


(18b) 


E®(r,z)  =  E<k>*(r,z)+i— -Ffk>(z)"+I^k5(z);  k»  1,  2,...  n7a) 

where 

f  d2u^kl*(r,z)a  fdy^r.z) 

ez  (r,z)  — j  — - dr  +  J - - - dr;  k  =  0,  1,  2,...  (l7b) 

where  u(k)  (r,z)  and  y^}(r,z)  are  given  by  eqns  (13b)  and  (14),  respectively.  The 
corresponding  normal  stress,  d(zk)(r,z),  can  be  obtained  by  using  Hooke’s  law  as  follows: 

d'k)(r,z)  =  CT(k>*(r,z)  +  r1-v‘F«(zr+EkF<k>(z)  +  -i^7F<k)(z);k=  1,2,...  (18a) 

where 

clk)*(r,z)  =  Eke*k)*(r,z)  +  vk {or(k)(r,z)  +  CTgk)*(r,z)}=  0;  k  =  1,  2,  ...  (18b) 

It  is  noteworthy  that  although  the  axial  strain,  e<k)(r,z),  or  its  stress  counterpart,  CT^k)(r,z), 

satisfies  the  compatibility  eqn  (16),  it  is  no  longer  in  equilibrium  (in  the  pointwise  sense)  with 
the  stresses,  T^}(r,z)and  cr(k)(r,z).  These  stresses  are,  therefore,  rederived  from  the 
following  two  equilibrium  equations: 

*hL  +  ln-+tejL  =  0 

dr  r  dz  09a) 

a<yr  ,  <Tr-<?8  ,  n 

dr  r  dz  (19b) 

xre  (r,z)  and  a[k)(r,z)  can  now  be  obtained  from  eqns  (19)  upon  integration  as  follows: 

z-(r.z,  =  I[-Jra^dr  +  F«(z)]  (20a) 

*!k’(r.Z)  =  ’dr  -  Jr^^dr  +  F<‘>(z)]  (20b) 

Substitution  of  dz  (r,z) ,  given  by  eqn  (18)  into  the  integrated  equilibrium  equation  (20a) 
yields 

^,(^)  =  e(r.z)-gF«(zr-^^>(z,--EkIp(l.(zr  +  ^  (21a) 


axF7,  +  xrz  +  daz 

_  A 

dr  r  dz 

—  U 

a<Tr  ,  ar  ~  ae  , 

3r  r  + 

dz 
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where 


>-(r,z)-— 

r  J  dz 


(21b) 


Subs,Kut,0„  of  df  >(r,z)and  ^(r.z)  given  by  eqns  (12)  and  (21),  respectively,  i„to  the 
second  integrated  equilibrium  equation  (20b)  yields 

^>(r,2)  =  <,?'('.*)  +  — f'2  -^>(^-+-3^  ^ 

(3-vk)(4-vk)  (l-vk)(2-vk)  1  (Z) 


where 


Jra^tL2drj 

r  J  J  dz  J  (22b) 

“:“r  "r boundary  ^  ^  <*«*»  <-■ « * 

I  ,  ,ant):„a  ,U"d  °f  mathematical  Alzheimer’s,  thus  rendering  the  boundary-value 

be^urfold05?'  ThiS  "  T  “  faC'  ‘hat  "*  ^  problem  has  now 

been  transformed  mto  an  tmual  value  problem,  which  is  analogous  to  Hadamard’s  treatment  of 

auchy  problem  (see  Ttkhonov  and  Arsenin,  1979).  This  Ul-posedness  will  be  removed  by 

envmg  appropnate  boundary  constraint  tenns,  which  are  functions  of  r,  that  help  satisfy  the 
end  boundary  conditions  at  surfaces  z  =  constant  as  follows:  ^ 

e<k,(r.z)  =  w«(r,z)  =  2L{a<»  -  vk(a<k>  +  a<k>))  (23) 

whence 

W<t,(r’ z)  =  ^  J  <d®  -  vk  <«Jk)  +  <^k)))dz  +  H<k>(r)  (24) 

Substitution  of  o2  (r,z),  o<k)(r,z)  and  d^k)(r,z),  given  by  eqns  (18),  (22)  and  (12) 
respectively  into  eqn.  (24)  gives 

w'k>(r,z)  =  Je<k>*(  r,z)dz  +  ±j jr'^F<k>(zr  +  EkF<k>(z)  +  ^(k^L 

V,  ,  r^-Vu  ,  r  k  J 

- Mf - - - - p(k)p,v,„  ,  vkr  Vk  ^  r-1_vk 

Eu  J ln-v.  v4_,mFi  (z)  +  t, - — - F,(k)rzr-£ _ poo,^ 


(22b) 


-- - - - k  p(k vkrl  V|t  n,\  r-1_vk 

EkJ'(3-vk)(4-vk)F'  (Z)  +^J^Fl^>"-~Fr(z) 
+Ek  jF<k>(zr  -  W+m  +  -^'(^dz  +  H(k,(r) 


(25a) 


6-12 


where 


e(k)*(r,z)  =  -^-[c(k)*(r,z)- vk{a(/°\r,z)  +  a(ek)*(r,z)}] 


(25b) 


Next,  using  the  relationship 

u(k)(r,z)  =  — x(k)(r,z)- w{k)(r,z)  (26) 

iZ  Gk 

whence  u^(r,z)  can  be  obtained  as  follows: 

u(k)(r,z)  =  —  f  [^(r.z)  -  w(k)(r,z)]dz  +  H^k)(r)  (27) 

Gk 

The  stress,  a[k)(r,z),  can  now  be  obtained  using  Hooke’s  law 

c(rk)(r,z)  =  Eke(rk)(r,z)  + vk[a(zk)(r,z)  +  d^k)(r,z)]  (28) 

where 

4k)(r,z)  =  u(k)(r,z)  (29) 

Finally,  the  stresses  xr2(r,z)  and  az(r,z)can  be  derived  using  the  equations  of  equilibrium 
(19)  as  follows: 

xrz(r,z)  =  ~  f [-|-{ror(r,z)}  +  ae(r,z)]dz  +  H(3k)(r)  (30) 

r J  dr 

cz(r,z)  =  -i  j  [|-{rx„(r,z)]dz  +  H<,k)(r)  (31) 

rJ  dr 

These  expressions  are  derived  using  the  symbolic  language,  MAPLE. 

Core  Region: 

Pagano's  (1991)  stress  field  for  the  core  layer  (core,  k  =  0,  always  fiber)  is  as  shown  below: 


5'°»(r,2)  =  p(»>(z)^  +  pS?(z)- 

r2  r2 

x^}(r,z)  =  p(52)(z)—  +  p(5°3)(z)(r-r2)r 
r2 

a[0)(r,z)  =  p{3(J)(z)(^^)  +  p(32)(z)— +  pS)(z)(r2-r22)r  +  p(3J)(z)(r-r2)r 
r2  r2 

The  hoop  stress,  a(0O)(r,z),  is  derived  using  the  same  procedure  as  above: 


(32a) 


(32b) 


(32c) 
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40)(r,z)  =  r(1+v»)  f 1(1  +  v0)rv»5I(r,z)  +  vkr(l"'v°)— ^r,z)  +  v0r(l'|-v°)  ^'^Idr  +  f£0)(z) 

(33) 

where  Fq°^(z),  is  a  "constant"  of  integration  (with  respect  to  r)  and  an  arbitrary  function  of  z. 

The  hoop  stress  in  the  core  is  obtained  by  substitution  of  a£0)(r,z)  and  a[0)(r,z)  from  eqns 
(32a,c)  into  the  right  side  of  eqn  (33) 

a^0)(r,z)  =  o^)*(r,z)  +  -I^-F^0)(z) 


(34a) 


where 


r(0)*/'r  rr')  —  n  M^O)/  ^ 


W  (r,z)  =  (1  +  v0)[P3i  (z)( 


Vq  +  1  r2(v0  +  2) 


)  +  p^(z) 


r2(vo  +  2) 


+  P(33}(ZX 


_2 

n2 


v0+4  v0  +  2 


) 


+P(34)(ZX  r 


r>r 


)]  +  Vo[-p^}(z) 


vo+3  vo  +  2 
2 


(v0  +  2)r2 


+  p(3^(z) 


-5Lr) + -  pfffz)  + pK'(z) 


r2(v0  +  2) 
(0) 


+  P33}(z)(  3r 


v0  +  2 


vo  +  3  v0  +  2 


r2(v0  +  2) 


r2(v0  +  2) 


] 


v0  +  4 
(34b) 


An  examination  of  eqns  (34)  reveals  that  a^(r,z)  becomes  unbounded  at  the  core  centerline,  r 
=  0.  Enforcement  of  the  boundedness  of  6^0)(0,z)  reduces  F,$0)(z)  to  0.  However,  it  may  be 
noted  that  unlike  in  the  case  of  an  annular  region  discussed  earlier,  the  radial  displacement 
component,  u(0)(r,z),  will  not  be  obtained  using  the  constitutive  relation,  given  by  eqn  (10a), 
because  this  will  result  in  an  over-determined  system  by  assigning  one  extra  condition  at  r  = 
constant  boundary/interface. 

Interlaminar  shear  strain,  y'^  (r,z)  and  the  radial  normal  strain,  er(0)(r,z),  can  now  be 
obtained  by  substitution  of  the  above  stresses  into  Hooke's  law: 


YreXriz)  =  — 

£r0>(r,z)  =  -~[ct*0>  -  v0(Oe0>  +  o‘0>)] 

En 


(35a) 

(35b) 


It  may  be  noted  that  the  above  two  strains,  E(rk)(r,z)and  y^}(r,z),  are  not  compatible  with  the 
axial  strain,  {Xk)(r,z),  computed  using  Hooke's  law 


l[0)(T,Z)  = 


^[g'0) 


■  v0(aek)  +  a|0,)j 


(35c) 
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because  the  remaining  compatibility  equation,  given  by  eqn  (16a)  is  not  satisfied.  It  may  be 
remarked  that  in  the  absence  of  an  exact  solution  to  axisymmetric  elasticity  boundary-value 
problem,  the  appropriate  variational  principle  will,  as  mentioned  earlier,  demand  a  different 
compatibility  equation,  such  as  eqn  (16a)  or  (16b). 

The  axial  strain  is,  therefore,  obtained  by  substituting  yJ^Cr.z)  and  e[o)(r,z)  given  by 

eqns  (35  a,b)  into  the  compatibility  equation  (16a)  as  follows: 


2=(0) 


an 


E0  dz2  r 


v0(^e0)  +  az0))}  + 


i  azx: 


2^(0) 


rz 


drdz 


(36) 


Further  substitution  of  eqns  32(a,c)  and  (34)  into  eqn  (36)  and  integration  with  respect  to  r 
twice  lead  to 


e(z0)(r,z)  =  e;(r,z)  +  rF<°WFf  }(z)  (37) 

The  corresponding  stress,  az°^(r,z),  from 

d(z0)(r,z)  =  E0e(z0)(r,z)  + vo{aJ0)(r,z)  +  a^0)(r,z)}  (38) 


can  be  expressed  in  the  form: 

a<0)(r,z>  =  <40)‘(r,z)  +  E0(rF,(0)  +  ^0))  (39a) 

with 

oz0)*(r,z)  =  E0e(z0)*(r,z)  +  v0{o[0)(r,z)  +  6^0)(r,z)}  (39b) 

It  may  be  noted  that  the  stresses  o(e0)(r,z)  and  6(z0)(r,z),  although  compatible,  are  no  longer  in 
equilibrium  with  the  remaining  stresses,  T^}(r,z)  and  o[0)(r,z),  given  by  eqns  (32b,c). 
These  stresses  are  obtained  using  the  equilibrium  equations  of  axisymmetric  elasticity  in  a 
manner  similar  to  an  annular  region,  mentioned  earlier: 


2  /  '  p(0) 

T(»(r,z)  =  Tg’VzJ-Eot^-Ff0)  (z)  +  ^F<°>  (z))  +  S_ 


rz 
where 
T(0) 


rz 


*(r,z)  =  --[J 


1f  ao(70)*(r,z) 


dz 


dr] 


and 


a(r°>(r,z)  =  40)*(r,z)+Eol^0>  (z)  +  ^F<0>  l_F30)  +  4~ 


where 


(40a) 


(40b) 


(41a) 
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(41b) 


d(0)*(r,z)  =  -[ f c(e0)*(r,z)dr -  f r^1*-2  ^^dr] 
r  J  J  dz 


Finally,  the  displacement  component,  w(0)(r,z)  is  obtained  by  utilizing  the  following 
constitutive  relations: 

■M(r  '  1 


e(20)(r,z)  =  w(2  )(r,z)  =  —  {d(z0)  -  v0(d(r0)  +a^0))) 

E0 


(42) 


The  new  ez0)(r,z)is  now  given  by 


-3  m  2  //  _  /  'c(O) 

ez0)(r,z)  =  e(z0)*(r,z)  +  rF1(0)(z)  +  F2(0)(z)- v0{— F,<0)  (z)  +  — F'0)  }  +  ^-{F3(0) 

12  6  En  r 


(43a) 

ezur  (r,z)  =  “Kur(r,z)-v0{G(r°r(r,z)  +  CT(e0)*(r,z)}]  (43b) 

w^(r,z)  can  now  be  obtained  by  integrating  e^0)(r,z)  as  folows: 


with 

;(0)*,r  J_rr(0)* 

E0 


w(0)(r,z)  =  w(0)*(r,z)  +  rj F|0)(z)dz  +  [ F^0)(z)dz  -  v0{— F[0)  (z)  +  — F^0)  } 

12 


—  {Fj0)(z)  -  -  f  F^0)(z)dz}  +  H^0)(r) 

0  rJ 


6 


E 

with 


(44a) 


w(0)*(r,z)  =  |e20)*(r,z)dz 


(44b) 


The  remaining  steps  for  derivation  of  u(0)(r,z),  the  stresses  Trz(r,z)  and  Gz(r,z) are  identical 
to  those  for  an  annular  region,  and  will  not  be  repeated  here.  These  expressions  are  derived 
using  the  symbolic  language,  MAPLE.  Finally,  the  unknown  functions  of  z  and  r  are  currently 
being  determined  using  MAPLE  from  boundary/interface  conditions  in  the  r-  and  z-directions, 
respectively. 

This  ends  the  first  iteration  or  cycle.  The  stresses  G^k)(r,z)  and  a(rk)(r,z)  computed 
above  will  now  be  substituted  back  into  eqn  (12)  to  compute  the  hoop  stress  G^k)(r,z)  in  the 
second  cycle,  and  the  process  can  be  continued  for  either  a  prescribed  number  of  cycles,  n,  or 
until  convergence  is  reached  within  certain  pre-determined  tolerance. 

The  above  procedure  introduces  4M  "constants"  of  integration,  Fj(k)(z),  i  =  4;  k  = 

0,  1,  2,...,  M-l  —  4  per  iteration —  which  are  functions  of  z,  for  each  iteration.  These  are 
determined  by  using  4M  appropriate  boundary/interface  conditions  per  iteration  including  those 
at  the  axis  of  symmetry. 
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Interfacing  or  Patching  of  Local  Asymptotic  Singular  and  Global  AwsvmroettlC 
Micromechanical  Stress  Fields 

Zak  (1964)  was  the  first  to  demonstrate  interfacing  a  finite  difference  based  global 
axisymmetric  nonsingular  stress  field  with  the  singular  stress  field  obtained  using  a  two- 
dimensional  (plane  strain)  asymptotic  solution  due  to  Williams  (1952).  This  approach  was 
illustrated  by  Zak  (1964)  by  matching  the  solution  of  his  asymptotic  analysis  with  the 
numerical  finite  difference  analysis  for  the  free-clamped  boundary  condition.  Zak  (1964) 
arbitrarily  chose  the  point  of  matching  at  a  distance  of  2%  of  the  thickness  of  the  annular 
region  for  one  of  the  three  stresses,  Cz,  cr  and  T^,  which  would  yield  the  stress  intensity 
factor.  The  remaining  two  stresses  computed  using  the  asymptotic  analysis,  when  multiplied 
with  this  factor,  would  match  very  closely  their  global  axisymmetric  counterparts  computed 
using  the  finite  difference  analysis.  Since  the  stress,  Oq,  cannot  be  derived  using  the  plane 
strain  asymptotic  analysis,  this  stress  was  left  out.  Zak  (1964)  further  remarked  that  if  the  finite 
difference  based  numerical  results  were  available  closer  to  the  point  of  stress  singularity, 
further  improvement  could  be  expected. 

Pochiraju  (1993)  in  continuing  Zak's  (1964)  research  computed  the  axisymmetric  stress 
field  using  a  highly  refined  finite  element  mesh  near  the  point  of  stress  singularity  with  the  help 
of  a  commercially  available  package,  ABACUS.  He  used  the  standard  8-noded  quadrilateral 
elements  with  no  special  singularity  formulation.  Like  Zak  (1964),  Pochiraju  (1993  matched 
one  stress  component  computed  using  the  FEM  (finite  elements  method)  at  a  point  on  the  fiber 
matrix  interface  at  a  very  close  distance  of  the  order  of  lO"4  -  10  7  times  the  fiber  radius  with 
that  obtained  from  the  asymptotic  analysis.  The  scaling  or  stress  intensity  factor  thus  computed 
when  multiplied  with  the  other  stress  components  from  the  asymptotic  analysis  or  with  the 
same  stress  component  at  other  angles  led  to  matching  of  all  the  stress  components  from  the 
two  analyses  at  all  angles  at  the  same  radius.  Pochiraju  (1993)  also  established  the  region  of 
dominance  for  each  term  of  the  asymptotic  analysis  with  an  error  function. 

Although  Zak's  (1964)  and  Pochiraju’s  (1993)  approach  is  effective  in  the  context  of 
two-dimensional  finite  difference  and  FEM  models,  such  a  brute  force  model  cannot  be  used  in 
situations  such  as  the  present  variational  axisymmetric  model  because  of  the  afore-mentioned 
eigenvalue  blow-up  or  in  any  three-dimensional  modeling  environment,  where  the  cost  of  this 
degree  of  refinement  will  be  prohibitive.  The  proposed  research  represents  a  novel  attempt  to 
recover  a  nearly  "exact"  elasticity  solution  by  successively  satisfying  the  equations  of  elasticity 
in  the  pointwise  sense,  as  opposed  to  mean-square  sense  of  the  finite  difference  or  FEM.  The 
improved  stress  field  can  easily  be  matched  with  its  asymptotic  analysis  counterpart  without 
having  to  resort  to  highly  refined  layering  approach  of  the  existing  numerical  techniques. 
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3.  Example  Problem  —  Expected  Results 

As  an  illustration  of  the  present  approach  and  to  examine  the  fidelity  of  its  predictive 
capability,  the  body  depicted  in  Figure  2  will  be  considered,  and  the  improved  results 
computed  using  the  present  compatibility/equilibrium  based  approach  will  be  compared  to  the 
corresponding  variational  model  solution  due  to  Pagano  (1991).  All  the  external  boundaries  are 
assumed  to  be  traction-free  and  that  the  body  is  subjected  to  a  1°C  temperature  rise.  As  a  first 
step,  the  fiber  is  represented  as  a  single  solid  cylinder  or  the  core,  while  the  matrix  is 
represented  as  a  single  annular  ring  (Pagano's  N=l).  Both  materials  are  assumed  to  be 
isotropic  with  the  following  properties 

Ef  =  413GPa  vf  =  0.2  af=  3.25  p/C 

^  =  63GPa  vm  =  0.2  ctm  =  3.50p/G 

where  the  subscripts  f  and  m  stand  for  fiber  and  matrix,  respectively,  and  the  geometric 
parameters  are  taken  as 


The  computed  results  (M  =  2)  will  be  compared  with  Pagano’s  (1991)  multi-ring  (e.  g.,  M  = 
10,  20,  etc.)  solution  in  order  to  assess  the  degree  of  improvement  of  the  computed  stress 
field.  Next,  the  effects  of  layering  and  number  of  iterations  (n  >1)  on  the  convergence  of  the 
present  solution  will  be  assessed  by  way  of  comparison  with  the  exact  nonsingular  series 
solution  due  to  Kurtz  and  Pagano  (1991).  Finally,  the  improved  "converged"  stress  field  will 
be  matched  with  the  asymptotic  singular  stress  field  at  a  close  distance  of  the  order  of  10"*- 1 0'7 
times  the  fiber  radius. 

4.  Closure 

A  semi-analytical  iterative  approach  for  enhancing  the  existing  two-dimensional  quasi- 
continuous  axisymmetric  stress  field  for  a  brittle  matrix  micro-composite  (i.  e„  a  single  fiber 
surrounded  by  a  concentric  matrix  cylinder),  is  proposed.  In  the  present  approach,  the  stress 
distribution  in  the  radial  direction  obtained  from  the  afore-cited  variational  model  due  to  Pagano 
(1991)  is  improved  a  posteriori  through  an  iterative  approach  that  involves  successive 
substitution  of  the  previously  computed  strains  (or  stresses)  into  the  equations  of  compatibility 
and  equilibrium.  A  semi-analytical  iterative  approach  for  enhancing  the  existing  two- 
dimensional  quasi-continuous  axisymmetric  stress  field  for  a  brittle  matrix  micro-composite  (i. 
e.,  a  single  fiber  surrounded  by  a  concentric  matrix  cylinder),  is  presented.  The  existing 
solution  employs  Reissner  s  variational  theorem  in  conjunction  with  an  equilibrium  stress  field 
in  which  the  radial  (r-)  dependence  is  assumed  a  priori. 

In  the  present  approach,  the  stress  distribution  in  the  radial  direction  obtained  from  the 
afore-cited  variational  model  is  improved  a  posteriori  through  an  iterative  approach  that 
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involves  successive  substitution  of  the  previously  computed  strains  (or  stresses)  into  the 
equations  of  compatibility  and  equilibrium.  The  equations  of  compatibility  are  selected  such 
that  they  form  Euler  equations  corresponding  to  appropriate  variational  principle,  such  as  the 
principle  of  minimum  complementary  potential  energy,  etc.  The  boundary/interface  conditions 
at  r  =  constant  and  z  =  constant  surfaces/interfaces  are  satisfied  in  the  pointwise  sense.  The 
expressions  for  the  improved  axisymmetric  displacement  and  stress  fields  are  derived  using  the 
symbolic  language,  MAPLE. 

An  illustrative  thermal  stress  problem  is  currently  being  solved,  and  will  be  used  to 
compare  with  the  existing  variational  solution.  When  completed,  this  research  will  represent  a 
novel  semi-analytical  post-processing  tool  to  improve  solution  accuracy  and  numerical 
efficiency  of  existing  variational  solutions  for  micro-composites.  The  final  results  will  be 
reported  in  Chaudhuri  et  al.  (to  be  published). 
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Figure  2.  Cross  Section  of  Domain  for  ADM  Analysis  (Pagano,  1991) 
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MECHANICAL  PROPERTIES  AND  TEXTURE  FORMATION 
DURING  THE  COMPRESSION  OF  TANTALUM 


Robert  J.  De  Angelis 
Professor 

Department  of  Mechanical  Engineering 
University  of  Nebraska-Lincoln 


Abstract 


The  production  of  tantalum  plate  with  controlled  degrees  of  anisotropy  is  essential  to  obtain 
reliable  performance  of  explosively  formed  projectiles  (EFP).  In  this  investigation  the  mechanical 
response  of  tantalum  was  determined  in  compression  for  materials  produced  employing  two 
methods:  hot  isostatic  compression  of  tantalum  powder;  and  cold  forging  and  rolling  of  a  cast 
ingot.  The  fiber  texture  formation  during  compression  testing  was  also  monitored  employing  x-ray 
diffraction  techniques.  The  densities  of  <100>,  <11 0>  and  <1 1 1>  directions  were  determined  as  a 
function  of  compressive  strain  for  the  two  types  of  materials.  In  the  hot  isostatic  compacted 
tantalum  power  specimens  the  densities  of  <1 1 1>  and  <100>  poles  were  observed  to  increase 
very  rapidly  during  the  initial  0.7  logarithmic  compression  strain.  Compression  of  specimens  cut 
in  the  rolling  and  transverse  directions  of  the  wrought  plate  showed  approximately  linear  increases 
of  the  densities  of  <1 1 1>  and  <100>  poles  with  compressive  strain.  The  rate  of  increase  of  the 
<1 1 1>  is  about  twice  the  rate  of  increase  of  the  <100> 

The  mechanical  response  of  the  specimens  in  compression  were  determined.  The  yield 
strength  of  the  hot  isostatic  pressed  tantalum  was  significantly  greater  than  the  wrought  plate 
material.  Both  materials  demonstrated  yield  points  at  the  initiation  of  plastic  deformation.  The 
flow  stresses  in  the  powder  processed  tantalum  were  four  percent  greater  in  specimens  with  axes 
of  compression  in  the  radial  direction  than  those  with  axes  of  compression  in  the  thru  thickness 
direction.  In  the  wrought  plate  material  the  specimens  with  compression  axes  parallel  to  the 
rolling  direction  had  2.5%  lower  flow  strength  than  specimens  with  compression  axes  parallel  to 
the  transverse  direction. 
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MECHANICAL  PROPERTIES  AND  TEXTURE  FORMATION 
DURING  THE  COMPRESSION  OF  TANTALUM 


Robert  J.  De  Angelis 


Introduction: 

Processing  parameters  have  a  important  influence  in  controlling  the  defect  structure  in 
metals.  In  the  case  of  pure  metals  the  grain  size,  grain  shape,  grain  orientation  and  total  defect 
concentration  are  controlled  by  processing.  Unquestionably,  processing  consisting  of  plastic 
deformation  by  compression  produces  changes  in  the  texture  of  a  polycrystalline  material.  This 
change  in  texture  is  accompanied  by  a  corresponding  change  in  engineering  properties  and  is  the 
prime  cause  of  anisotropy  in  polycrystalline  metallic  materials.  Texture  or.  preferred  orientation 
imparts  the  anisotropic  properties  of  the  single  crystal  to  the  polycrystalline  aggregate.  In 
applications  which  demand  that  the  material  demonstrate  extensive  plasticity,  anisotropic  behavior 
due  to  the  existence  of  a  texture  is  normally  undesirable.  This  is  a  important  consideration  in  the 
processing  of  face  centered  cubic  metals,  such  as  aluminum  and  copper.  However,  it  is  a  critical 
concern  in  processing  body  centered  cubic  metals,  especially  tantalum  and  niobium.  To  control 
the  anisotropic  properties  developed  in  tantalum  due  to  preferred  orientation  the  effects  of  plastic 
deformation  on  texture  formation  must  be  precisely  determined.  To  aid  in  accomplishment  of  this 
task  requires  the  availability  of  a  reliable  and  simple  methods  to  characterize  the  texture  existing 
at  every  stage  of  processing. 

Despite  the  general  recognition  that  detailed  texture  description  is  needed  to  control 
macroscopic  properties  very  few  investigations  of  materials  processing  have  included  the 
quantification  of  texture.  Since  the  1980's  the  description  of  material  textures  or  crystal 
orientations  in  polycrystalline  wires  and  sheets  started  to  move  beyond  the  crystallographic  pole 
figure  representation  of  texture.  The  orientation  distribution  function  (ODF)  has  become  the 
method  of  choice  for  presenting  the  description  of  material  textures.  A  great  advantage  of  the 
ODF  method  of  texture  representation  is  that  it  is  capable  of  giving  an  accurate  representation  of 
the  inverse  pole  figure.  In  this  investigation  the  inverse  pole  figure  will  be  employed  to  quantify 
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the  pole  densities  of  the  <100>,  <1 10>  and  <1 1 1>  poles  in  tantalum  as  a  function  of  plastic  strain 
in  compression. 

The  objective  of  this  investigation  is  to  develop  the  processing  parameters  that  produce 
tantalum  with  improved  performance  in  EFP’s  by  controlling  the  size,  shape  and  crystallographic 
orientation  of  the  metallic  grains.  To  succeed  in  this  endeavor  requires  the  definition  of  the 
relationships  existing  between  the  mechanical  processing  sequence,  the  crystallographic  texture, 
and  the  mechanical  properties  of  the  material.  This  approach  leads  to  improved  capability  of 
controlling  the  mechanical  properties  of  tantalum  that  can  be  exploited  to  enhance  the  design  of 
EFPs. 
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Materials  Processing: 

Powder  Process :  Tantalum  powder  having  the  following  chemistry:  H  28  PPM;  O  182  PPM;  Si 
10  PPM;  Fe  <5  PPM;  Mo  5  PPM;  C  16  PPM  and  N  26  PPM,  was  consolidated  by  Ceracon , 

Inc.  (1)  The  average  particle  size  of  the  powder  was  13  microns  and  the  particle  size  distribution 
of  the  powder  was  100/200  mesh  -  16.9%,  200/325  mesh  -31.3%,  and  -325  mesh  51.6%. 
Approximately  2.04  kg  of  powder  was  formed  into  16.5  mm  (0.65  inches)  thick  and  140  mm  (5.5 
inches)  diameter  near  net  shape  preform  by  cold  isostatic  pressing  of  the  powder  in  an  elastomer 
bag  at  345  MPa.  The  preform  was  hot  isostatically  forged  by  Ceracon,  Inc.  Prior  to  the  final 
forging  the  preform  was  wrapped  in  tantalum  foil  placed  in  a  stainless  steel  container  and  held  for 
24  hours  at  400 °C  under  a  constant  vacuum  of  10'5torr.  The  preform  was  Ceracon™  forged  into 
a  to  a  disk  13  mm  thick  and  150  mm  in  diameter  employing  a  pressure  of  980  MPa,  at  1 186°C 
and  a  dwell  time  of  30  seconds.  The  average  grain  size  in  the  as  received  plate  was  determined  by 
Dr.  Joel  House  to  be  37.5  um  -• 

Cabot  Plate:  A  305  mm  diameter  ingot  was  produced  by  Cabot  Corporation  (Cabot  ingot 
number  50014)  utilizing  a  casting  process  consisting  of  vacuum  arc  remelting  of  a  three  electron 
beam  processed  electrode  ingot  (Cabot  ingot  number  80313).  The  ingot  chemistry  furnished  by 
Cabot  Corp.  was:  H  <5  PPM;  O  45  PPM;  Si  <5  PPM;  Fe  <5  PPM;  Mo  <5  PPM;  C  15  PPM  and 
N  <10  PPM.  The  cast  ingot  was  turned  to  290  mm  diameter  and  side  forged  at  200  °C  to  a  100 
mm  thick  slab  for  rolling.  The  slab  was  machined  clean,  annealed  at  1050°C  and  cold  rolled  to  an 
8  mm  thick  plate  with  the  rolling  direction  perpendicular  to  the  ingot  axis.  The  plate  was  annealed 
at  1050°C  for  one  hour.  The  average  grain  size  determined  from  nine  locations  in  the  plate  was 
67. 1  um.(2)  The  range  of  grain  size  observed  was  between  50.0  and  79. 1  um  with  a  standard 
deviation  of  7.2  um.  (2) 

Experimental  Methods  and  Results: 

The  experimental  techniques  employed  in  this  investigation  can  be  partitioned  into  three  task 
areas:  x-ray  pole  figure  determinations;  quantifacation  of  pole  figure  data  and  mechanical  property 
determination.  These  tasks  were  executed  at  the  Air  Force  Research  Laboratory  at  Eglin  Air 
Force  Base,  FL  (AFRL/MNMW). 

Powder  Process:  Compression  specimen,  7.6  mm  diameter  by  7.6  mm  in  length,  were  machined 
from  the  Ceracon™  forged  plate.  These  specimens  were  produced  with  the  compression  axes  in 
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the  thru  thickness  (CZ)  and  in  the  radial  directions  (CR).  Theses  specimens  were  tested  in 
compression  at  a  nominal  strain  rate  of  0.025/s.  The  compressed  specimens  were  sliced  in  half 
transverse  to  the  compression  axis,  metallographically  prepared  using  standard  practice  (3)  and 
the  prepared  specimens  were  utilized  for  the  determination  of  x-ray  pole  figures. 

£gbot  Plate:  Compression  specimens,  7.6  mm  diameter  by  7.6  mm,  were  machined  from  the 
plate  which  was  processed  from  Cabot  ingot  50014.  Specimens  were  produced  with  compression 
axes  in  the  rolling  (RD)  and  transverse  (TV)  directions  of  the  plate.  (The  rolling  direction  is 
normal  to  the  ingot  axis  and  the  transverse  direction  is  parallel  to  the  ingot  axis.)  These 
specimens  were  tested  in  compression  at  a  nominal  strain  rate  of  0.025/s.  The  compressed 
specimens  were  sliced  in  half  transverse  to  the  compression  axis,  metallographically  prepared  and 
employed  for  x-ray  pole  figure  measurements. 

X-Ray  Pole  Figures:  X-ray  pole  figures  were  determined  using  the  Siemens  D5000 
diffractometer  at  AFRL/MNMW.  The  specimens  were  prepared  for  x-ray.,  analysis  and  the  (110), 
(200)  and  (211)  pole  figures  were  recorded  by  Dr.  Joel  House.  All  measurements  were  made  on 
surfaces  that  had  been  mid-planes  of  the  compression  specimens.  These  surfaces  were  exposed  by 
machining.  The  pole  figure  data  were  converted  to  ODFs  (4)  and  inverse  pole  figures  employing 
both  popLA  (5)  and  Siemens  (6)  software. 

Powder  Process:  Pole  figures  were  determined  on  eight  specimens,  four  of  the  specimen  surfaces 
had  normals  parallel  to  the  thickness  direction  of  the  plate  and  four  of  the  specimen  surfaces  had 
normals  parallel  to  the  radial  direction  of  the  plate.  The  specimen  orientations  along  with  the 
mechanical  testing  parameters  for  the  powder  processed  specimens  are  listed  in  Table  I.  All  eight 
samples  were  sectioned  to  expose  the  mid-plane  of  the  compressed  disk.  The  mid-plane  surface 
was  prepared  for  x-ray  investigation  by  metallographically  polishing  and  etching  (3). 

Cabot  Plate:  Pole  figures  were  determined  on  eleven  specimens,  five  of  the  specimen  surfaces 
had  normals  parallel  to  the  rolling  direction  of  the  plate  and  six  of  the  specimen  surfaces  had 
normals  parallel  to  the  transverse  direction  of  the  plate.  The  specimen  identifications  along  with 
the  mechanical  testing  parameters  for  the  Cabot  plate  specimens  are  listed  in  Table  I.  All  eleven 
samples  were  sectioned  at  the  mid-plane  following  compression  deformation.  The  mid-plane 
surface  was  prepared  for  x-ray  investigation  by  metallographically  polishing  and  etching. (3) 
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Table  I :  Tantalum  Specimens  and  Mechanical  Testing  Parameters 


Specimen 

ID 

Material 

Direction 

Stages 

Lq  Inch. 

Lp Inch. 

Logarithmic 

strain 

CZ-1 

Powder 

Thru  Thick 

1 

0.297 

0.268 

-0.100 

CZ-2 

Powder 

Thru  Thick 

1 

0.297 

0.228 

-0.264 

CZ-3 

Powder 

Thru  Thick 

2 

0.296 

0.157 

-0.634 

CZ-4 

Powder 

Thru  Thick 

3 

0.296 

0.086 

-1.236 

CZ-5 

Powder 

Thru  Thick 

3 

0.300 

0.141 

-0.755 

CZ-6 

Powder 

Thru  Thick 

1 

0.298 

0.225 

-0.281 

CR-2 

Powder 

Radial 

1 

0.299 

.  0.225 

-0.284 

CR-3 

Powder 

Radial 

2 

0.300 

0.158 

-0.641 

CR-4 

Powder 

Radial 

3 

0.299 

0.096 

-1.136 

CR-5 

Powder 

Radial 

3 

0.300 

0.141 

-0.755 

RD-1 

Plate  50014 

Rolling 

1 

0.301 

0.277 

-0.083 

RD-2 

Plate  50014 

Rolling 

1 

0.300 

0.225 

-0.288 

RD-3 

Plate  50014 

Rolling 

2 

0.300 

0.153 

-0.673 

RD-4 

Plate  50014 

Rolling 

3 

0.300 

0.131 

-0.829 

RI)-5 

Plate  50014 

Rolling 

3 

0.300 

0.084 

-1.273 

TV-1 

Plate  50014 

Transverse 

1 

0.300 

0.258 

-0.151 

TV-2 

Plate  50014 

Transverse 

1 

0.298 

0.224 

-0.285 

TV-3 

Plate  50014 

Transverse 

2 

0.300 

0.151 

-0.686 

'  TV-4  y^ 

Plate  50014 

Transverse 

3 

0.301 

0.130 

-0.840  . 

TV-5 

Plate  50014 

Transverse 

3 

0.301 

0.086 

-1.250 

TV-6 

Plate  50014 

Transverse 

1 

0.300 

0.273 

-0.094 
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Texture  Quantifacation: 

The  texture  quantifacation  employed  in  this  investigation  is  an  extension  of  the  method 
reported  by  Hosford  (7)  which  is  based  on  inverse  pole  figures.  This  method  calculates  the 
fraction  of  poles  contained  within  2.5°,  7.5°  and  12.5°  of  the  <1 1 1>  <100>  and  <1 10> 
orientations.  In  this  investigation  the  Hosford  analysis  was  extended  to  calculate  the  fraction  of 
non-oriented  grains  in  a  specimen  and  the  texture  distribution.  Three  mutually  perpendicular 
orientations  of  inverse  pole  figures  are  presented.  One  orientation  is  in  the  direction  of 
compression  axis,  the  second  is  in  a  direction  normal  to  the  compression  axis,  and  the  third  is 
perpendicular  to  the  initial  two  directions. 

The  fraction  of  non-oriented  grains  is  defined  as;  (l-I(F<m>+  F<100>  +  F<110>  )},  where  the 
F<wd>  sre  the  fractions  of  grains  within  the  chosen  angle  of  the  selected  <hkl>  orientation.  In  the 
case  of  an  ideally  random  distribution  of  grain  orientations  the  F<hU>  values  for  an  angle  of  12.5° 
are  0.0965,  0.0715  and  0.1428  for  the  <1 1 1>,  <100>  and  <1 10>  orientations  respectively.  These 
random  values  allow  the  data  from  any  specimen  to  be  expressed  as  a  multiple  of  the  random 

value  (  times  random  )  simply  by  dividing  the  measured  values  by  the  ideally  random  values  given 
above. 

The  texture  distribution  was  also  determined  by  simple  extension  of  these  calculations.  In  the 
case  of  an  ideally  random  distribution  of  grain  orientations  the  F<hkl>  values  for  an  angle  of  7.5° 
are  0.0351,  0.0290  and  0.0576,  and  for  an  angle  of  2. 5°  are  0.0050,  0.0034  and  0.0079  for  the 
<1 1 1>,  <100>  and  <1 10>  orientations  respectively.  Data  from  any  specimen  can  be  normalized 
to  times  random  at  the  2.5  ,  7.5°  and  12.5°  angles  using  the  fraction  of  grains  data  from  the 
ideal  random  grain  orientation  for  these  three  angles.  This  provides  a  method  to  inspect  the 
texture  distribution  about  all  of  the  three  major  pole  orientations;  <1 1 1>,  <100>  and  <1 10>. 
Powder  Process:  The  fraction  of  grains  within  12.5°  of  the  <1 1 1>,  <100>  and  <1 10>  poles  in 
the  thickness  and  radial  directions  of  the  Ceracon™  forged  plate  as  a  function  of  compressive 
strain  are  shown  in  Fig.  1.  During  compression  of  a  body  centered  cubic  (BCC)  metal  the  <1 1 1> 
and  <100>  poles  align  to  the  compression  axis.  As  these  poles  align  the  <1 10>  poles  rotate  away 
from  the  compression  axis.  The  data  in  Fig.  1  indicates  that  the  grain  rotations  occurred  during 
the  initial  0.6  compressive  strain.  At  this  strain  about  80%  of  the  grains  are  oriented  within  12.5° 
of  the  <1 1 1>  and  <100>  poles.  At  larger  strains  the  values  of  the  fraction  of  oriented  grains 
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remains  constant.  The  data  in  Fig.  1  are  normalized  to  random  and  shown  in  Fig.  2.  These  data 
clearly  show  the  similar  behavior  of  the  thickness  and  radial  directions  during  compression  and 
indicate  the  <100>  texture  (>5  times  random)  is  a  little  stronger  than  the  <1 1 1>  texture  (>4  times 
random). 

The  data  in  Fig.  1  were  employed  to  determine  the  fraction  of  non-oriented  grains  as  a 
function  of  compressive  strain  as  are  shown  in  Fig.  3 .  In  an  ideally  random  sample  the  fraction  of 
grains  outside  of  12.5°  of  the  <1 1 1>,  <100>  and  <1 10>  is  0.689.  As  shown  in  Fig.  3  the  fraction 
of  non-oriented  grains  decreases,  for  both  the  thickness  and  radial  specimens,  from  0.6  at  a  strain 
of  zero  to  0.19  at  a  strain  of  0.6.  The  texture  strengths  calculated  for  a  2.5°  angle  encircling  the 
<1 1 1>  <100>  and  <1 10>  poles  for  the  powder  processed  specimens  are  recorded  in  Table  II. 

The  maximum  <1 1 1>  texture  strengths  of  21  and  17  were  observed  in  specimens  CZ-4  and  CR-4. 

The  texture  distribution  in  the  as  forged  tantalum  powder  plate  is  shown  in  Fig.  4.  The 
maximum  strength  of  the  <1 1 1>  and  <100>  texture  components  in  the  thru  thickness  direction  are 
slightly  less  than  2.5.  The  strengths  of  these  components  are  about  equal  with  the  <100>  texture 
being  slightly  stronger.  Weaker  textures  exist  in  the  two  mutually  perpendicular  directions  normal 
to  the  thru  thickness  direction  where  the  <1 10>  has  a  maximum  strength  of  less  than  1.5.  The 
texture  distributions  are  broad  in  all  of  the  three  directions  of  observation.  Only  the  <100>  and 
<1 1 1>  poles  show  a  slight  degree  of  the  texture  sharpening  in  the  thru  thickness  direction.  The 
texture  distribution  in  the  thru  thickness  direction  sharpens  dramatically  after  being  compressed  to 
a  strain  of  1.236,  as  shown  in  Fig.  5.  The  maximum  strengths  of  the  <1 1 1>  and  <100>  textures 
are  now  greater  than  20  times  random  and  show  a  rapid  variation  with  encirclement  angle  about 
these  two  poles.  The  texture  strengths  increase  from  about  5  times  random  at  a  deviation  angle  of 
12.5°  to  12  times  random  at  a  deviation  angle  of  7.5°  and  to  greater  than  20  times  random  at  a 
deviation  angle  of  2.5°.  As  expected  this  rapid  increase  in  texture  with  deviation  angle  in  the 
direction  of  the  compression  axis  is  not  observed  in  the  two  directions  of  observation  normal  to 
the  compression  axis. 
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Table  13:  Texture  Strengths  in  Units  of  Times  Random  for  Compression  Specimens. 

Data  Are  for  2.5°  Encirclement  Angle  Around  the  <Hkl>  Poles  Indicated 


jlSlHHH 

Material 

<111> 

<I00> 

<110> 

|  Nat.  Strain 

As  Forged 

Powder 

Thru  Thick 

2.37 

2.42 

0.5 

0 

CZ-6 

Powder 

Thru  Thick 

7.46 

8.08 

0.16 

-0.281 

CZ-3 

Powder 

Thru  Thick 

14.7 

14.92 

0 

-0.634 

CZ-5 

Powder 

Thru  Thick 

9.91 

13.24 

0 

-0.755 

CZ-4 

Powder 

Thru  Thick 

20.75 

20.23 

0 

-1.236 

CZ-2 

Powder 

Radial 

9.07 

8.58 

0.27 

-0.284 

CR-3 

Powder 

Radial 

13.53 

13.46 

mm 

-0.641 

CR-5 

Powder 

Radial 

7.96 

12.82 

0 

-0.755 

CR-4 

Powder 

Radial 

16.83 

13.39 

0.1 

-1.136 

RD-1 

Plate  50014 

Rolling 

0.71 

2.1 

1.4 

-0.083 

RD-2 

Plate  50014 

Rolling 

4.27 

3.75 

0.48 

-0.288 

RD-3 

Plate  50014 

Rolling 

10.93 

5.98 

0.42 

-0.673 

RD-4 

Plate  50014 

Rolling 

10.5 

7.49 

0.18 

-0.829 

RD-5 

Plate  50014 

Rolling 

8.72 

9.71 

0.17 

-1.273 

TV-6 

Plate  50014 

Transverse 

1.4 

2.53 

1.15 

-0.094 

TV-1 

Plate  50014 

Transverse 

0.41 

2.51 

1.44 

-0.151 

TV-2 

Plate  50014 

Transverse 

2.98 

3.72 

0.49 

-0.285 

TV-3 

Plate  50014 

Transverse 

7.36 

6.55 

0.04 

-0.686 

TV-4 

Plate  50014 

Transverse 

7.56 

6.72 

0.19 

-0.84 

TV-5 

Plate  50014 

Transverse 

12.22 

6.57 

0.16 

-1.25 
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The  texture  distribution  in  the  radial  directions  sharpen  to  a  lesser  degree  than  the  thru 
thickness  direction  during  compression  to  a  strain  of  1 . 136,  as  shown  in  Fig.  6.  The  maximum 
strengths  of  the  <1 1 1>  and  <100>  textures  are  about  15  times  random  and  show  a  rapid  variation 
with  encirclement  angle  about  these  two  poles.  The  texture  strengths  increase  from  about  5  times 
random  at  a  deviation  angle  of  12.5°  to  10  times  random  at  a  deviation  angle  of  7.5°  and  to  15 
times  random  at  a  deviation  angle  of  2.5°.  As  in  the  case  of  the  of  thru  thickness  compression, 
the  rapid  increase  in  texture  with  deviation  angle  in  the  direction  of  the  radial  compression  axis  is 
present  to  a  lesser  degree  in  the  two  directions  of  observation  normal  to  the  compression  axis. 
Cabot  Plate:  The  fraction  of  grains  within  12.5°  of  the  <1 1 1>,  <100>  and  <11 0>  poles  in  the 
rolling  and  transverse  directions  of  the  rolled  Cabot  plate  50014  as  a  function  of  compressive 
strain  are  shown  in  Fig.  7.  Grain  rotation  in  compression  of  a  BCC  metal  aligns  the  <1 1 1>  and 
<100>  poles  to  the  compression  axis.  As  these  poles  align  the  <1 10>  poles  rotate  away  from  the 
compression  axis.  The  data  in  Fig.  7  indicates  that  the  grain  rotations  occurred  during  the  initial 
0.7  compressive  strain.  At  this  strain  about  70%  of  the  grains  are  oriented  within  12.5°  of  the 
<1 1 1>  and  <100>  poles.  The  data  contained  in  Fig.  7  were  normalized  to  random  and  shown  in 
Fig.  8.  Now  the  different  behavior  of  specimens  taken  from  the  rolling  and  transverse  directions 
during  compression  is  apparent.  The  texture  of  the  specimens  with  compression  axes  in  the  rolling 
direction  was  consistently  stronger  than  the  texture  in  the  specimens  having  compression  axes 
transverse  to  the  rolling  direction.  The  strength  of  the  <1 1 1>  texture  was  generally  between  4 
and  5  times  random  which  was  a  little  stronger  than  the  <100>  texture  (between  3  and  4  times 
random).  The  data  in  Fig.  7  are  used  to  calculate  the  fraction  of  non-oriented  grains  as  a  function 
of  compressive  strain  and  shown  in  Fig.  9.  In  an  ideal  random  sample  the  fraction  of  grains 
outside  of  12.5°  of  the  <11 1>  <100>  and  <1 10>  is  0.689.  As  seen  in  Fig.  9  the  fraction  of  non- 
oriented  grains  decreases  from  0.65  ,  almost  random,  at  a  strain  of  about  0.1  to  a  value  of  0.21  at 
a  strain  of  1.2.  The  data  from  the  rolling  and  transverse  specimens  were  significantly  different. 
There  is  a  more  rapid  grain  rotation  occurring  in  the  specimens  having  compression  axes  parallel 
to  the  rolling  direction,  see  Fig.  9. 

The  texture  strengths  for  2.5°  encirclement  angle  around  the  <1 1 1>,  <100>  and  <1 10>  poles 
for  the  Cabot  plate  specimens  are  also  contained  in  Table  D.  The  maximum  <1 1 1>  texture 
strengths  of  1 1  and  12  were  observed  in  specimens  RD-3  and  TV-5. 
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The  texture  distribution  in  the  specimen  cut  with  the  compression  axis  parallel  to  the  rolling 
direction  of  the  tantalum  plate  after  compression  to  a  strain  of  1.257  is  shown  in  Fig.  10.  The 
maximum  strength  of  the  texture  in  the  direction  of  compression  is  between  1 0  and  1 1  times 
random.  The  strengths  of  the  <1 1 1>  and  <100>  textures  are  equal  and  their  distributions  are  the 
same  and  show  no  indication  of  being  stronger  nearer  the  <hkl>  pole.  The  texture  distributions 
existing  in  the  two  perpendicular  directions  normal  to  the  compression  axis  are  identical.  This 
indicates  the  formation  of  a  well  developed  fiber  texture.  The  texture  distribution  in  a  specimen 
cut  with  the  compression  axis  parallel  to  the  transverse  direction  of  the  plate  after  compression  to 
a  strain  of  1.250  is  shown  in  Fig.  11.  The  <1 1 1>  texture  distribution  in  the  direction  of 
compression  is  sharper  than  that  observed  in  the  specimen  having  the  compression  axis  parallel  to 
the  rolling  direction;  see  Fig.  10.  However,  the  <100>  texture  component  is  weaker  and  less 
sharp  than  that  observed  in  the  specimen  having  a  compression  axis  parallel  to  the  rolling 
direction.  The  texture  distributions  existing  in  the  two  perpendicular  directions  normal  to  the 
compression  axis  are  considerably  different  in  texture  strengths.  This  not  only  indicates  the  lack 
of  formation  of  a  uniform  fiber  texture  but  may  signify  the  existence  of  variation  in  mechanical 
response  in  the  two  perpendicular  directions. 

Mechanical  Properties: 

Compression  tests  at  a  initial  strain  rate  of  0.025/s  were  performed  using  an  Instron  testing 
machine  having  a  50,000  load  cell.  The  mechanical  testing  parameters  are  listed  in  Table  I.  The 
compression  tests  were  done  in  stages,  with  each  stage  usually  consisting  of  approximately  an 
engineering  strain  of  O.j.  The  number  of  stages  and  the  total  logarithmic  strains  for  each  specimen 
are  in  Table  I.  Between  each  stage  of  the  compression  deformation  each  specimen  was 
remachined  to  a  circular  cross-section  .  The  load-displacement  data  from  each  stage  was 
transformed  to  true  stress  and  logarithmic  strain.  The  elastic  strain  portion  of  the  stress  strain 
curve  was  subtracted  from  the  data  to  demonstrate  the  data  as  true  stress  versus  logarithmic 
plastic  strain  curves. 

Powder  Process:  Typical  compression  stress  versus  strain  curves  for  the  three  stage  specimen 
CZ-4  and  CR-4  are  shown  in  Fig.  12  and  Fig.  13.  These  figures  contain  graphs  of  engineering 
stress  versus  engineering  strain,  true  stress  versus  logarithmic  strain  and  true  stress  versus 
logarithmic  plastic  strain.  The  mechanical  behavior  of  these  two  specimens  was  almost  identical 
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with  both  showing  a  yield  point  in  the  second  stage  of  deformation;  however  only  CR-4  showed  a 
yield  point  in  the  first  stage  of  compression.  The  values  of  the  upper  yield  points,  lower  yield 
points  and  true  flow  stress  at  logarithmic  strains  of  0. 1,  0.2  and  0.3  for  all  powder  specimens  are 
listed  in  Table  HI.  The  sample  to  sample  variations  are  less  than  ±10  MPa.  However,  the 
specimens  cut  from  the  radial  direction  demonstrate  flow  stress  that  are  approximately  20  MPa 
higher  than  the  specimens  compressed  in  the  thru  thickness  direction.  The  yield  stress  is  about 
500  MPa  and  the  flow  stress  at  a  strain  of  0.3  is  700  MPa.  These  specimens  deformed 
symmetrically  and  did  not  exhibit  any  eccentricity, 

Cabot  Plate: 

Typical  compression  stress  versus  strain  curves  for  the  three  stage  specimen  RD-5  and  TV-5 
are  shown  in  Fig.  14  and  Fig.  15  respectively.  These  figures  contain  graphs  of  engineering  stress 
versus  engineering  strain,  true  stress  versus  logarithmic  strain  and  true  stress  versus  logarithmic 
plastic  strain.  The  mechanical  behavior  of  these  two  specimens  was  almost  identical  with  both 
showing  a  yield  point;  however  the  magnitude  of  the  yield  point  in  specimen  TV-5  was  greater  as 
can  be  seen  from  the  data  in  Table  IV.  Also  included  in  this  table  are  the  values  of  the  upper  yield 
points,  lower  yield  points  and  true  flow  stress  at  logarithmic  strains  of  0. 1,  0.2  and  0.3  for  all  plate 
specimens.  The  sample  to  sample  variations  are  less  than  ±6  MPa;  however,  the  specimens  with 
compression  axis  in  the  transverse  direction  demonstrate  flow  stress  that  are  approximately  1 0 
MPa  higher  than  the  specimens  compressed  in  the  rolling  direction.  The  yield  stress  is  about  260 
MPa  and  the  flow  stress  at  a  strain  of  0.3  is  370  MPa.  The  samples  deformed  in  an  non-circular 
fashion  during  each  stage  of  compressive  deformation.  The  degree  of  eccentricity  was  calculated 
using,  e  =  {  1  -  (b/a)2  }Vl where  b  and  a  are  the  minimum  and  maximums  in  diameter  observed.  The 
eccentrics  for  the  specimens  are  shown  in  Fig.  16.  The  specimens  tended  to  show  eccentricities  of 
0.3  except  in  the  last  two  stages  of  deformation  for  RD-5  when  the  eccentricities  were  0.6. 
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Table  III :  Yield  and  Flow  Stresses  of  Tantalum  Specimens  Produced  From  Powder. 


Specimen; 

ID. 

;  Direction 

Upper 

YP  MPa 

Lower 

YP  MPa 

Flow  Stress 

Strain  =  0.1 

Flow  Stress; 

Strain  -  0.2 

Flow  Stress 

Strain  =  0.3 

!■  CZ-1 

Thru  Thick 

495 

471 

559 

NA 

NA 

;  CZ-2 

Thru  Thick 

488 

473 

555 

651 

661 

CZ-3 

|  Thru  Thick 

504 

476 

573 

657 

706 

CZ-4 

Thru  Thick 

NA 

475 

577 

663 

710 

CZ-5 

Thru  Thick 

522 

483 

581 

667 

717 

CZ-2 

Radial 

520 

484 

592 

668 

716 

CR-3 

Radial 

510 

490 

588 

673 

721 

CR-4 

Radial 

495 

490 

586 

671 

718 

CR-5 

Radial 

516 

489 

582 

668 

716 

Table  IV  :  ^ 

Teld  and  Flow  Stresses  of  Tantalum  Specimens  Produced  From  Cabot  Plate  50014 

Specimen 

ID: 

Direction 

I  - 

Upper 

YP  MPa 

Lower 

YPMPa 

I  Flow  Stress 

Strain  -  0  1 

Flow  Stress 

Strain  =  0.2 

Flow  Stress 

Strain  =  0.3 

RD-l 

Rolling 

278 

252 

297 

NA 

NA 

RD-2 

Rolling 

262 

260 

309 

342 

373 

RD-3 

Rolling 

268 

263 

310 

345 

378 

RD-4 

Rolling 

265 

249 

303 

340 

371 

RD-5 

Rolling 

256 

251 

299 

334 

367 

TV-1  \ 

Transverse 

NA 

270 

318 

NA 

NA 

TV-2 

Transverse 

279 

266 

316 

350 

380 

TV-3 

Transverse 

261 

259 

313 

351 

382 

TV-4 

Transverse 

283 

256 

307 

346 

379 

TV -5 

Transverse 

289 

256 

310 

350 

384 

TV-6 

Transverse 

284 

259 

312 

NA 

NA 
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Discussion: 

Crystallographic  texture  formation  during  compression  was  significantly  different  in  the 
powder  and  plate  materials.  The  powdered  tantalum  formed  strong  <1 1 1>  and  <100>  textures  at 
about  the  same  rate  with  compressive  strain,  see  Fig.  1 .  In  the  case  of  the  plate  material  the 
<1 1 1>  texture  formed  much  faster  with  compressive  strain  than  did  the  <100>,  see  Fig.  7.  At 
high  degrees  of  strain  the  fraction  of  grains  that  are  within  12.5 0  of  the  <1 1 1>  are  about  1 .5  time 
the  grain  fraction  that  are  within  12.5°  of  the  <100>.  Based  on  multiplicity  the  ratio  of  <1 1 1> 
grains  to  <100>  grains  would  be  1.33.  The  result  indicates  the  deformation  occurred  in  an 
uninhibited  fashion.  That  is  there  was  no  inhibition  of  the  cross  slip  processes  and  the  number 
grains  that  rotated  to  each  of  the  two  stable  orientation  were  simply  proportional  to  the  number  of 
each  type  available.  The  high  interstitial  content  in  the  powder  processed  material  would  be 
expected  to  have  a  significant  effect  on  the  cross-slip  behavior  during  deformation.  If  cross  slip  is 
inhibited  then  <1 1 1>  slip  on  a  { 1 10}  plane  has  equal  probability  deforming  toward  a  <1 1 1>  or  a 
<100>  pole  producing  the  texture  demonstrated  by  the  powder  processed  material.  The  <100> 
and  <1 1 1>  grain  fractions  are  equal  at  high  degrees  of  strain,  see  Fig.  7. 

Chemistry  differences  between  the  powder  processed  and  wrought  material  accounted  for 
large  differences  in  the  yield  and  flow  stresses  of  the  tantalum.  The  average  values  of  the  upper 
yield  stress,  the  lower  yield  stress  and  the  flow  stresses  at  logarithmic  strains  of  0.1,  0.2  and  0.3 
are  given  in  Table  V.  The  yield  stress  and  flow  stress  values  of  the  Cabot  plate  material  are  about 
50%  lower  than  the  values  observed  for  the  powder  processed  material.  The  effects  of  the 
processing  on  the  mechanical  properties  of  the  tantalum  can  be  seen  by  comparing  the  data  from 
the  two  directions  tested  for  each  material.  These  differences  are  due  to  a  combination  of  effects 
including  grain  shape,  grain  morphology  and  grain  orientation  or  texture.  The  stress  values  in  the 
power  processed  material  are  greater  in  the  radial  direction  by  about  15  MPa  or  about  2%.  The 
Cabot  plate  material  has  higher  strength  and  flow  stress  values  in  the  specimens  having  a 
compression  axis  in  the  transverse  direction.  The  flow  stress  values  in  the  transverse  specimens 
are  9  MPa  or  2.5%  greater  than  the  specimens  with  their  compression  axes  in  the  rolling  direction. 
It  would  be  of  value  to  determine  the  contributions  of  the  various  microstructural  characteristics 
to  the  flow  stress;  however  the  data  sets  are  not  sufficiently  extensive  to  make  the  necessary 
correlations. 
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Table  V:  Average  of  the  Yield  and  Flow  Stresses  of  Tantalum  Specimens  . 


Specimen 

ID: 

Direction 

Avg.  Upper 

YP  MPa 

Avg.  Lower 

YP  MPa 

Flow  Stress 

Strain  =  0.1 

Row  Stress 

Strain  -  0.2 

Flow  Stress 

Strain  =  0.3 

CZ 

Thru  Thick 

495.6 

475.6 

569.0 

659.5 

704.7 

•••• 

Radial 

512.6 

488.2 

587.0 

670.0 

717.8 

CZ  +  CR 

Both 

506.6 

481.2 

577.0 

664.7 

708.1 

RD 

Rolling 

265.8 

255.0 

303.6 

340.2 

372.3 

::  TV 

Transverse 

279.2 

261.0 

312.7 

349.3 

381.3 

RD  +  TV 

Both 

272.5 

258.3 

308.5 

344.8 

376.8 

Table  VI :  Standard  Deviations  of  the  Yield  and  Flow  Stresses  of  Tantalum  Specimens 


Specimen 

ID: 

Direction 

Stand;  Dev 

UYPMPa 

Stand.  Dev 

LYP  MPa 

Stand.  Dev. 

Strain  0.1 

Stand  Dev 

Strain  0.2 

Stand.  Dev 

Strain  =  0.3 

CZ 

Thru  Thick 

8.02 

4.50 

11.4 

7.0 

15.4 

CR 

Radial 

10.8 

2.90 

4.16 

2.45 

2.36 

CZ  +  CR 

Both 

12.4 

7.60 

12.7 

7.42 

17.6 

RD 

Rolling 

8.13 

6.12 

5.81 

4.64 

4.57 

TV 

Transverse 

10.8 

5.72 

3.98 

2.22 

2.22 

RD  +  TV 

Both 

11.4 

6.42 

6.62 

5.87 

5.85 

The  standard  deviations  in  the  yield  stress  and  flow  stress  data  is  presented  in  Table  VI. 

These  values  indicate  that  the  differences  observed  in  the  stress  values  are  real.  For  example  the 
differences  between  the  flow  stress  in  specimens  with  compression  axes  parallel  and  transverse  to 
the  rolling  direction  in  the  plate  material  is  9  MPa  while  the  standard  deviations  for  all  of  the  data 
from  these  two  directions  have  a  maximum  of  6.62. 
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Fig.  1  Fraction  of  Oriented  Grains  as  a  Function  of  Compressive  Strain  for  Powder  Processed 
Tantalum  with  the  Compression  Axes  in  the  Thickness  and  Radial  Directions  for  the  Pole 
Orientations  Indicated:  ▲  -  <1 1 1>  ■  -  <100>  and  •  -  <1 10>. 

Fig-  2  The  Texture  Coefficients  as  a  Function  of  Compressive  Strain  for  Powder  Processed 
Tantalum  with  the  Compression  Axes  in  the  Thickness  and  Radial  Directions  for  the  Pole 
Orientations  Indicated:  ▲  -  <1 1 1>,  ■  -  <100>  and  •  -  <1 10>. 

Fig.  3  Fraction  of  Non  -Oriented  Grains  as  a  Function  of  Compressive  Strain  for  Powder 
Processed  Tantalum. 

Fig.  4  The  Distribution  of  the  Texture  Coefficients  for  the  As  Forged  Powder  as  a  Function  of 
the  Angle  Encircling  the  <hkl>  Poles  Indicated,  a)  In  the  Direction  of  Compression,  b)  In  a 
Direction  Normal  to  the  Compression  Axis,  and  c)  In  a  Direction  Normal  to  The  Compression 
Axis  and  Perpendicular  to  the  Direction  in  (b).  .  •  .  .. 

Fig.  5  The  Distribution  of  the  Texture  Coefficients  for  CZ-4,  a  Thru  Thickness  Specimen,  as  a 
Function  of  the  Angle  Encircling  the  <hkl>  Poles  Indicated,  a)  In  the  Direction  of  Compression, 
b)  In  a  Direction  Normal  to  the  Compression  Axis,  and  c)  In  a  Direction  Normal  to  The 
Compression  Axis  and  Perpendicular  to  the  Direction  in  (b). 


Fig.  6  The  Distribution  of  the  Texture  Coefficients  for  CR-4,  a  Radial  Specimen,  as  a  Function 
of  the  Angle  Encircling  the  <hkl>  Poles  Indicated,  a)  In  the  Direction  of  Compression,  b)  In  a 
Direction  Normal  to  the  Compression  Axis,  and  c)  In  a  Direction  Normal  to  The  Compression 
Axis  and  Perpendicular  to  the  Direction  in  (b). 


Fig.  7  Fraction  of  Oriented  Grains  as  a  Function  of  Compressive  Strain  for  Cabot  Plate 
Tantalum  with  the  Compression  Axes  in  the  Rolling  and  Transverse  Directions  for  the  Pole 
Orientations  Indicated:  ▲  -  <1 1 1>  ■  -  <100>  and  •  -  <1 10>. 


Fig.  8  The  Texture  Coefficients  as  a  Function  of  Compressive  Strain  for  Cabot  Plate  Tantalum 
with  the  Compression  Axes  in  the  Rolling  and  Transverse  Directions  for  the  Pole  Orientations 
Indicated:  ▲  -  <1 1 1>,  ■  -  <100>  and  •  -  <1 10>. 

Fig.  9  Fraction  of  Non  -Oriented  Grains  as  a  Function  of  Compressive  Strain  for  Cabot  Plate 
Tantalum  for  the  Specimen  Orientations  Indicated. 
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Fig.  10  The  Distribution  of  the  Texture  Coefficients  for  RD-5,  a  Rolling  Direction  Specimen,  as 
a  Function  of  the  Angle  Encircling  the  <hkl>  Poles  Indicated,  a)  In  the  Direction  of  Compression, 
b)  In  a  Direction  Normal  to  the  Compression  Axis,  and  c)  In  a  Direction  Normal  to  The 
Compression  Axis  and  Perpendicular  to  the  Direction  in  (b). 

Fig.  1 1  The  Distribution  of  the  Texture  Coefficients  for  TV-5,  a  Transverse  Specimen,  as  a 
Function  of  the  Angle  Encircling  the  <hkl>  Poles  Indicated,  a)  In  the  Direction  of  Compression, 
b)  In  a  Direction  Normal  to  the  Compression  Axis,  and  c)  In  a  Direction  Normal  to  The 
Compression  Axis  and  Perpendicular  to  the  Direction  in  (b). 

Fig.  12  The  Compression  Stress  Versus  Strain  Curves  for  Thru  Thickness  Specimen  CZ-4.  a) 
Engineering  Stress  -  Engineering  Strain,  b)  True  Stress  -  Logarithmic  Strain,  and  c)  True  stress  - 
Logarithmic  Plastic  Strain. 

Fig.  13  The  Compression  Stress  Versus  Strain  Curves  for  Radial  Specimen  CR-4.  a)  Engineering 
Stress  -  Engineering  Strain,  b)  True  Stress  -  Logarithmic  Strain,  and  c)  True  stress  -  Logarithmic 
Plastic  Strain. 

Fig.  14  The  Compression  Stress  Versus  Strain  Curves  for  Rolling  Direction  SpecimeaRD-5.  a) 
Engineering  Stress  -  Engineering  Strain,  b)  True  Stress  -  Logarithmic  Strain,  and  c)  True  stress  - 
Logarithmic  Plastic  Strain. 

Fig.  15  The  Compression  Stress  Versus  Strain  Curves  for  Transverse  Direction  Specimen  TV-5, 
a)  Engineering  Stress  -  Engineering  Strain,  b)  True  Stress  -  Logarithmic  Strain,  and  c)  True  stress 
-  Logarithmic  Plastic  Strain. 

Fig.  16  Eccentricity  at  Each  Stage  of  Deformation  of  Specimens  Taken  from  the  Cabot  Plate:  (a) 
With  Compression  Axes  Oriented  in  the  Rolling  Direction  and  (b)  With  Compression  Axes 
Oriented  in  the  Transverse  Direction. 


7-20 


Fraction  of  Oriented  Grains 

Powdered  Tantalum 


SUIEJ0  jO  UOjJOBJ  J 


7-21 


Fig.  1  Fraction  of  Oriented  Grains  as  a  Function  of  Compressive  Strain  for  Powder  Processed 
Tantalum  with  the  Compression  Axes  in  the  Thickness  and  Radial  Directions  for  the  Pole 
Orientations  Indicated:  A  -  <1 1 1>,  ■  -  <100>  and  •  -  <1 10>. 
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Fig.  3  Fraction  of  Non  -Oriented  Grains  as  a  Function  of  Compressive  Strain  for  Powder 
Processed  Tantalum. 
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Fig.  4  The  Distribution  of  the  Texture  Coefficients  for  the  As  Forged  Powder  as  a  Function  of 
the  Angle  Encircling  the  <hkl>  Poles  Indicated,  a)  In  the  Direction  of  Compression,  b)  In  a 
Direction  Normal  to  the  Compression  Axis,  and  c)  In  a  Direction  Normal  to  The  Compression 
Axis  and  Perpendicular  to  the  Direction  in  (b). 
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bHn 1 n-  f ,  ^-S  2  E"clrcImS the  <hkl>  Poles  Indicated,  a)  In  the  Direction  of  Compressk 
b)  In  a  Direction  Normal  to  the  Compression  Axis,  and  c)  In  a  Direction  Nornird  to  The 
Compression  Aus  and  Perpendicular  to  the  Direction  in  (b) 
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Fig.  6  The  Distribution  of  the  Texture  Coefficients  for  CR-4,  a  Radial  Specimen,  as  a  Function 
of  the  Angle  Encircling  the  <hkl>  Poles  Indicated,  a)  In  the  Direction  of  Compression,  b)  In  a 
Direction  Normal  to  the  Compression  Axis,  and  c)  In  a  Direction  Normal  to  The  Compression 
Axis  and  Perpendicular  to  the  Direction  in  (b). 
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Fig.  7  Fraction  of  Oriented  Grains  as  a  Function  of  Compressive  Strain  for  Cabot  Plate 
Tantalum  with  the  Compression  Axes  in  the  Rolling  and  Transverse  Directions  for  the  Pole 
Orientations  Indicated:  A  -  <1 1 1>,  ■  -  <I00>  and  •  -  <1 10>. 
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Fig.  8  Tlie  Texture  Coellicients  as  a  Function  of  Compressive  Strain  for  Cabot  Plate  Tantalum 
with  the  Compression  Axes  in  the  Rolling  and  Transverse  Directions  for  the  Pole  Orientations 
Indicated:  ▲  -  <1 1 1>,  ■  -  <100>  and  •  -  <1 10>. 
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Fig.  10  The  Distribution  of  the  Texture  Coefficients  for  RD-5,  a  Rolling  Direction  Specimen,  as 
a  Function  of  the  Angle  Encircling  the  <hkl>  Poles  Indicated,  a)  In  the  Direction  of  Compression, 
b)  In  a  Direction  Normal  to  the  Compression  Axis,  and  c)  In  a  Direction  Normal  to  The 
Compression  Axis  and  Perpendicular  to  the  Direction  in  (b). 
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Fig.  11  The  Distribution  of  the  Texture  Coefficients  fhr  tv  ^  „  t  c 

Function  of  the  Anele  Encircling  <hiYYl,  rJ  TY~5’  a  Transverse  Specimen,  as  a 

b)  In  a  Direction  Normal  to  the r  ^  •  P°  eS  Indicated-  a) In  Direction  of  Compression, 

Comnres  ion  th'.C°mPression  A™,  and  c)  In  a  Direction  Normal  to  The 

compression  Axis  and  Perpendicular  to  the  Direction  in  (b). 
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Fig.  14  The  Compression  Stress  Versus  Strain  Curves  for  Rolling  Direction  Specimen  RD-5.  a) 
Engineering  Stress  -  Engineering  Strain,  b)  True  Stress  -  Logarithmic  Strain,  and  c)  True  stress 
Logarithmic  Plastic  Strain. 
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Fig.  15  The  Compression  Stress  Versus  Strain  Curves  for  Transverse  Direction  Specimen  TV-5 
a)  Engineering  Stress  -  Engineering  Strain,  b)  True  Stress  -  Losarithmic  Strain,  and  c)  True  stress 
-  Logarithmic  Plastic  Strain. 
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Fig.  16  Eccentricity  at  Each  Stage  of  Deformation  of  Specimens  Taken  from  the  Cabot  Plate:  (a) 
With  Compression  Axes  Oriented  in  the  Rolling  Direction  and  (b)  With  Compression  Axes 
Oriented  in  the  Transverse  Direction. 
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THE  STUDY  OF  A  TRANSVERSE  JET  IN  A  SUPERSONIC 
CROSS-FLOW  USING  ADVANCED  DIAGNOSTICS 


Gregory  S.  Elliott 
Assistant  Professor 

Department  of  Mechanical  and  Aerospace  Engineering 
Rutgers  University 

Abstract 

Transverse  injection  of  multi-hole  sonic  injectors  into  a  Mach  3  flow  has  been  experimentally 
investigated  using  optimization  techniques.  The  technique  allows  for  the  optimization  of  the 
positions  of  multiple  injectors  based  on  the  efficiency  of  the  mixing  as  defined  by  the  mixing  area, 
penetration  depth,  and  growth  rate  of  the  jet  at  three  locations  downstream  from  the  injector  center. 
These  quantities  were  obtained  by  passing  a  spanwise  sheet  of  light  from  the  second  harmonic  of  a 
Nd:  YAG  pulsed  laser  through  the  flow  which  illuminated  the  jet  mixing  seeded  with  ethanol.  The 
optimization  was  obtained  using  Simulated  Annealing  with  Iterative  Improvement  which  was  able 
to  improve  the  mixing  compared  to  the  single  jet  with  the  same  total  exit  area.  Optimized 
geometries  were  found  when  “best”  mixing  was  defined  by  the  mixing  area  and  penetration  depth 
improving  the  base  line  case  by  82%  and  8%  respectively  for  the  two  injector  geometry.  For  the 
three  injector  geometry,  the  improvement  was  66%  for  mixing  area,  2%  for  penetration,  and  12  % 
for  growthrate.  Also,  a  figure  of  merit  combining  all  mixing  definitions  was  defined  and  optimized 
showing  an  improvement  of  43%  and  9%  for  the  two  and  three  injector  geometries,  respectively. 
Further  research  beyond  the  scope  of  this  preliminary  investigation  will  involve  studying  other 
optimization  routines  applied  to  experimental  analysis  and  experiments  which  would  explain  the 
improvements  in  mixing  given  above. 
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THE  STUDY  OF  A  TRANSVERSE  JET  IN  A  SUPERSONIC 
CROSS-FLOW  USING  ADVANCED  DIAGNOSTICS 


Gregory  S.  Elliott 


Introduction 

Compressible  mixing  layers  have  been  the  subject  of  much  interest  motivated  by  applications 
such  as  supersonic  air-breathing  propulsion  systems  and  reduced  heat  signatures  of  high-speed 
aircraft.  However,  supersonic  mixing  is  not  as  effective  as  their  subsonic  counterparts  to 
compressibility  effects.  The  transverse  jet  in  a  supersonic  cross  flow  (illustrated  in  Figure  1)  has 
been  studied  in  the  past,  motivated  by  its  application  in  several  proposed  supersonic  propulsion 
systems,  in  which  the  qualitative  flow  features  such  as  the  three-dimensional  bow  shock,  separation 
shock,  jet  shock  structure,  and  the  upstream  and  downstream  separation  zones  were  observed. 
Beyond  this,  recent  experiments  have  been  conducted  to  quantify  the  mixing  and  penetration  of  the 
jet  [Lee  etal.,1992;  Hermanson  and  Winter,  1993;  and  Gruber  etal.,  1997].  Schetz  and  Billig  [1966] 
found  that  the  penetration  could  be  parameterized  by  the  jet-to-free  stream  momentum  flux,  given 
by: 

(Y  PM2) 

j  =  - -!  (i) 

(Y  PM2)e 

where  y  is  the  specific  heat  ratio,  P  is  the  static  pressure,  M  is  the  Mach  number,  and  the  subscripts 
j  and  e  indicate  the  jet  and  free  stream  quantities,  respectively.  A  variety  of  geometries  have  been 
investigated  to  enhance  the  mixing  characteristics  of  the  transverse  jet,  including  swept  ramps 
[Hartfield  et  ah,  1994  and  Donohue  et  ah,  1994],  multiple  ports  [Cox  et  ah,  1994  and  Fuller  et  ah, 
1996],  inclined  injection  [McCann  and  Bowersox,  1996],  and  circular  and  non-circular  jet 
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geometries  [Gruber  et  al.,  1995].  It  was  observed  that  these  mixing  enhancement  schemes  could  lead 
to  greater  mixing,  but  often  times  at  the  cost  of  higher  stagnation  pressure  losses. 

There  are  two  goals  of  the  current  study.  First  is  to  investigate  the  best  placement  of  multiple 
injectors  to  obtain  optimum  mixing  with  the  ffeestream.  Although  this  problem  could  be  studied 
computationally,  the  flowfield  from  multiple  injectors  presents  a  significant  problem  due  to  the 
highly  three  dimensional  nature  of  the  flow  and  the  need  to  have  a  grid  which  can  resolve  the 
significant  gradients  of  the  complex  interaction  between  the  shocks  and  multiple  jets.  Because  of 
these  computational  limitations,  and  the  availability  of  the  supersonic  windtunnel  at  Rutgers  the 
current  study  focuses  on  investigating  this  problem  experimentally.  This  leads  to  the  second  goal 
of  the  study  which  is  to  create  an  experimental  optimization  methodology  incorporating  image 
processing  to  evaluate  the  mixedness,  rapid  prototyping  to  manufacture  the  models,  and  optimization 
techniques  to  suggest  new  geometries  to  evaluate.  Not  only  does  this  reduce  the  number  of 
experiments  needed  to  find  the  best  geometry  for  this  study,  but  will  create  the  infrastructure  for 
future  studies  of  other  flowfields.  Two  relatively  simple  geometries  were  investigated  in  this  study: 
the  interaction  of  two  circular  jets  with  a  Mach  3  free  stream  and  the  interaction  of  three  circular  jets 
with  a  Mach  3  free  stream.  The  design  variables  were  the  relative  positions  of  the  circular  injectors 
for  the  jets  with  the  goal  of  optimizing  the  mixing  of  the  jet.  Following  is  a  description  of  the 
optimization  routine  used  in  the  study,  a  description  of  the  experiments  conducted,  and  discussion 
of  the  results. 

Optimization  Methodology 

The  first  step  in  the  experimental  optimization  of  the  mixing  of  multiple  transverse  jet 
injection  is  to  select  the  optimization  method  to  be  used  to  explore  the  design  space  (i.e.  placement 
of  the  injectors),  evaluate  the  figure  of  merit  (FOM,  i.e.  mixing),  and  propose  new  geometries.  Once 
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the  optimization  method  is  found,  this  will  help  guide  how  initial  geometries  are  selected,  the  mixing 
definition  are  evaluated,  new  geometries  selected,  and  models  manufactured. 

Within  the  field  of  fluid  dynamics  there  has  been  much  interest  in  using  optimization 
techniques  particularly  in  the  design  of  aerodynamic  structures.  For  example,  shape  optimizations 
have  been  performed  to  redesign  hypersonic  inlets  to  cancel  cowl  shocks  and  cowl-generated 
compression  [Shukla  et  al.,  1996],  to  minimize  the  peak  inlet  Mach  number  [  Reddy  and  Reddy 
,1995],  and  to  minimize  total  pressure  loss  [Munipally  et  al.,  1995],  In  general  these  studies  are 
computational,  using  routines  which  can  be  evaluated  several  thousands  of  times  with  modest 
computer  resources.  As  mentioned  previously  the  current  use  of  a  computational  model  in  the 
current  study  is  prohibitive  and  therefore  the  optimization  method  must  reflect  the  constraints  of  the 
experimental  procedure  (i.e.  a  limited  number  of  experiments  which  can  be  performed). 

Various  optimization  algorithms  have  been  used  in  the  past.  The  simplest  type  are  gradient 
based  methods  where  the  differences  in  the  value  of  the  objective  parameter  over  its  domain  are  used 
to  calculate  the  slope  and  point  to  the  best  location.  This  method  has  the  advantage  of  simplicity 
and  can  always  lead  to  a  better  solution,  but  has  the  disadvantage  of  often  requiring  many  iterations 
to  converge  on  the  optimum  and  a  difficulty  of  knowing  before  hand  what  the  step  size  should  be. 
In  addition,  gradient  based  schemes  can  sometimes  become  trapped  on  a  portion  of  the  domain  and 
converge  on  a  globally  non-optimum  solution.  Genetic  algorithms  attempt  to  mimic  the  process  of 
natural  selection  by  starting  with  a  collection  of  possible  solutions.  At  each  iteration,  through 
processes  of  recombination  and  mutation  better  solutions  are  formed  until  the  optimal  solution  is 
achieved.  Genetic  algorithms  have  the  advantage  of  working  well  on  complicated  domains  and 
being  able  to  handle  many  types  of  optimization  functions  (discrete,  continuous,  mixed)  but  have 
the  disadvantage  of  requiring  large  data  sets.  Random  probe  methods  have  also  been  used  for 
optimization.  Here  a  function  representing  the  objective  parameter  is  randomly  probed  over  its 
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domain  in  order  to  locate  possible  optimized  regions.  At  each  iteration  the  domain  is  reduced  to 
those  areas  where  local  optimum  solutions  were  found.  The  disadvantage  of  this  method  is  also  the 
need  for  large  data  sets. 

At  first  a  gradient  based  method  was  considered  to  optimize  a  function  which  exactly  fit  the 
data  and  related  the  objective  to  the  hole  locations.  Once  the  initial  sets  of  data  are  taken,  the  design 
parameters  and  objective  function  data  could  be  substituted  into  the  function  and  coefficients  are 
solved.  In  order  for  the  solution  to  exist,  the  number  of  unknown  coefficients  and  data  sets  must  be 
equal.  The  problem  can  then  be  expressed  in  matrix  form  as  : 

[l,x,x2, x3, ...]  [A]  =  [F]  (2) 

where  the  first  matrix  represents  polynomial  terms  of  hole  positions,  [A]  is  the  coefficient  matrix 
to  be  solved  for,  and  [F]  is  the  objective  function  vector.  These  steps  can  be  performed  by  a  Matlab 
program  which  prompts  the  user  to  enter  the  quantity  to  be  optimized  (i.e.  mixedness  definition). 
It  then  optimizes  the  function  subject  to  design  constraints  starting  from  an  initial  guess.  This  initial 
guess  is  varied  over  some  possible  values  for  positions  of  the  injectors.  The  program  used  a 
Sequential  Quadratic  Programing  method  in  which  a  subproblem  obtained  by  linearizing  the 
nonlinear  constraints  is  solved  at  each  iteration.  The  routine  produces  a  series  of  optimized 
geometries  and  their  corresponding  objective  function  values.  These  new  geometries  could  be 
fabricated  and  tested  to  see  how  well  they  agree  with  the  predicted  values  and  also  provide  additional 
data  points  for  a  more  refined  optimization  search  (i.e.  an  objective  function  with  more  terms).  This 
procedure  can  then  be  repeated  until  the  results  converge  to  within  some  tolerance. 

One  disadvantage  of  this  method  lies  in  trying  to  match  the  data  exactly.  In  doing  so,  the 
objective  function  is  often  required  to  make  large  deviations  from  the  general  pattern  produced  by 
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the  data  sets  giving  unphysical  function  values.  In  order  to  avoid  this,  a  regression  analysis  was  used 
instead  of  a  perfectly  fit  function.  This  requires  more  data  for  a  polynomial  of  the  same  order,  but 
smooths  for  experimental  inaccuracies.  Two  criteria  were  used  to  determine  the  order  of  the 
polynomial,  the  number  of  experimental  data  points  available  in  the  local  region  inspected,  and  that 
the  difference  between  the  experimental  points  and  the  function  is  less  than  5%.  In  addition,  a  better 
optimization  scheme  was  found  which  makes  use  of  both  random  probe  and  gradient  methods.  This 
technique  is  referred  to  as  Simulated  Annealing  with  Iterative  Improvement  --SAWI  (Aly  et  al., 
1995),  and  is  based  on  the  simulated  annealing  (SA)  optimization  algorithm  first  introduced  by 
Kirkpatrick  et  al.  (1983).  SAWI  essentially  initiates  the  SA  process  to  locate  the  neighborhood  of 
the  global  optimum.  Prior  to  the  convergence  of  SA,  the  algorithm  switches  to  random  search 
iterative  improvement,  a  local  search  method,  to  converge  to  the  optimum.  The  Transition 
Parameter,  a  measure  of  where  SA  is  in  the  optimization  process,  is  used  to  terminate  SA  and  initiate 
iterative  improvement.  As  SA  is  relatively  inefficient  in  converging  to  a  final  solution  once  its 
neighborhood  has  been  identified,  the  use  of  a  local  search  method  significantly  reduces  the  overall 
computation  time  (Ogot  and  Alag,  1 993).  These  new  geometries  are  fabricated  and  tested  to  see  how 
well  they  agree  with  the  predicted  values  and  also  provide  additional  data  points  for  a  more  refined 
optimization  search  (i.e.  an  objective  function  with  more  terms).  This  procedure  can  then  be 
repeated  until  the  best  geometry  has  been  found. 

Experimental  Facility 

The  experiments  were  performed  in  the  variable  supersonic  wind  tunnel  located  in  the  Gas 
Dynamics  and  Laser  Diagnostic  Laboratory  at  Rutgers  University.  The  tunnel  operates  at  a 
stagnation  temperature  and  pressure  of  293  K  and  584  kPa  respectively.  The  nozzle  is  adjustable 
and  can  vary  the  Mach  number  continuously  from  1 .2  to  3.5,  for  the  present  study  the  Mach  number 
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was  set  to  3.0.  The  injector  pressure  was  regulated  to  a  pressure  of  412  and  390  kPa  for  the  two  and 
three  hole  injector  cases  resulting  in  a  jet-to-free  stream  momentum  flux  ratio  of  1.5  and  1.43, 
respectively.  The  test  section  has  cross  sectional  dimensions  of  102  mm  by  1 14  mm  with  glass 
windows  for  optical  access  on  three  sides. 

For  the  present  study,  a  removable  circular  insert  containing  the  injector  geometry  was  made 
to  fit  into  the  wind  tunnel  wall.  The  circular  insert  was  created  using  a  rapid  prototype  process 
termed  stereo-lithography.  The  stereo-lithography  system  (SLA  1 90/20)  is  controlled  using  the 
IDEAS  CAD/CAM  software  package.  A  solid  modeling  program  was  written  to  automatically  create 
the  circular  insert  with  the  multiple  injectors  with  inputs  of  the  x-y  positions,  diameters,  eccentricity 
(for  elliptic  geometries)  and  injection  angle  of  the  holes  of  the  injector.  Also  the  program 
automatically  incorporated  a  converging  transition  region  for  each  jet  as  well  as  the  groove  for  the 
o-ring  used  to  seal  the  circular  insert.  It  should  be  noted  that  the  jets  studied  in  this  investigation 
were  all  circular  and  perpendicular  to  the  flow.  The  total  time  to  create  a  set  of  6  injectors  is  about 
1 2  hours  due  to  the  curing  time  of  the  uv  sensitive  polymer.  It  could  be  argued  that  this  simple 
geometry  could  be  more  easily  manufactured  using  standard  machining  techniques,  but  it  was  desired 
to  create  a  more  generalized  infrastructure  for  more  complicated  models  to  be  studied  in  the  future. 

Figure  2  gives  a  schematic  of  the  tunnel,  laser,  and  camera  geometry.  The  frequency-doubled 
Nd:YAG  laser  used  for  this  experiment  was  a  Spectra  Physics  GCR-230  with  a  capability  of 
delivering  600  mJ  per  pulse.  The  pulse  duration  of  approximately  10  ns  effectively  freezes  the  image 
of  the  turbulent  motion  in  the  flow  field  resulting  in  instantaneous  measurements.  The  laser  beam 
was  formed  into  a  collimated  sheet  by  a  combination  of  cylindrical  and  spherical  optics.  First  the 
collimated  beam  (9  mm  in  diameter )  passes  through  a  cylindrical  lens  which  expands  the  light  in 
the  lateral  dimension  relative  to  the  tunnel.  This  is  followed  by  a  spherical  lens  which  reduces  the 
thickness  of  the  laser  sheet  providing  a  thin  light  sheet  passing  spanwise  through  the  wind  tunnel. 
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To  visualize  the  mixed  region,  the  jet  air  was  bubbled  through  a  seeding  chamber  filled  with  ethanol. 
Upon  mixing  with  the  cold  free  stream  the  ethanol  freezes  forming  small  particles  which  mark  the 
mixed  fluid.  This  technique  called  product  formation  has  been  used  to  study  the  mixing  of 
compressible  shear  layers  and  axisymmetric  jets  [Clemens  and  Mungal,  1992],  The  scattered  signal 
is  collected  using  a  Princeton  Instruments  14-bit  intensified  CCD  camera.  The  images  are  stored  on 
a  Pentium  100  MHZ  personal  computer  providing  camera  control,  laser  synchronization,  and  image 
processing.  All  the  images  used  in  the  mixing  analysis  described  below  were  oriented  in  the 
spanwise  direction  with  the  flow  oriented  out  of  the  page  having  a  spanwise  scale  of  68  mm  across 
the  image. 

Experimental  Procedure 

The  illustration  in  Figure  3  gives  a  schematic  of  the  experimental  procedure  used  to  optimize 
the  injector  hole  positions.  The  first  step  in  the  process  (Step  i)  is  to  select  random  points  in  the 
design  space  which  “fully”  represent  the  space  and  have  a  significant  variation  in  the  FOM.  Included 
in  the  initial  selection  of  injector  positions  is  the  base  line  case  which  is  a  single  jet  with  the  same 
exit  area  as  the  individual  jets  combined.  In  the  next  step  (Step  A)  the  circular  inserts  for  each  hole 
pattern  are  manufactured  using  stereo  lithography.  In  general  6  inserts  are  manufactured 
simultaneously.  Next  each  geometiy  is  run  in  the  wind  tunnel  (Step  B).  The  experiments  included 
recording  75  instantaneous  spanwise  images  at  three  downstream  locations  (x/D  =  5,  11,  17)  from 
the  centroid  of  the  hole  pattern.  For  image  processing  purposes  background  images  and  an  image 
of  a  dot  card  was  taken  for  each  case. 

Before  the  mixing  quantities  are  calculated  from  the  spanwise  images  they  are  corrected  for 
intensity  and  geometric  distortion  (Step  C).  To  correct  for  intensity,  a  75  instantaneous  background 
images  without  the  signal  from  the  flow  were  taken  and  averaged  together.  This  average  background 
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image  was  subtracted  from  each  image  and  the  images  were  normalized  by  the  average  maximum 
value.  This  insured  that  images  from  day  to  day  and  at  different  streamwise  locations  had  the  same 
dynamic  range.  Since  it  was  desired  to  measure  these  quantities  at  different  locations  downstream, 
the  camera  and  laser  sheet  had  to  be  frequently  moved  producing  a  geometric  bias  in  the  size  and 
location  of  the  images  as  well  as  bias  due  to  the  oblique  viewing  angle  of  the  camera.  To  correct  for 
these  geometric  effects,  an  image  of  a  grid  of  dots  was  taken  each  time  the  camera’s  position 
changed.  An  image  processing  program  was  developed  which  located  the  dots  on  the  dot  card  and 
uses  this  information  to  realign  the  images  correcting  for  geometric  distortions  due  to  image  size 
changes  and  viewing  at  an  oblique  angle. 

From  the  experimentally  obtained  spanwise  images  three  different  figures  of  merit  were 
calculated  with  a  forth  parameter  calculated  by  combining  all  three  together  and  weighting  them 
equally  (Step  D).  Figure  4  illustrates  two  of  these  parameters  the  mixing  area  (Ap)  and  jet 
penetration  depth  (yp).  The  mixing  area  represents  the  amount  of  the  jet  that  has  been  entrained  into 
the  freestream.  The  mixing  area  for  each  instantaneous  image  was  calculated  by  summing  the 
number  of  pixel  which  are  above  5%  of  the  average  maximum  value.  These  values  are  then  averaged 
and  converted  to  square  mm.  The  repeatability  of  the  mixing  area  to  day  to  day  variations  and  tunnel 
conditions  variation  was  found  to  be  less  than  2%.  The  jet  penetration  represents  the  ability  of  the 
jet  to  intrude  into  the  free  stream  away  from  the  surface.  The  penetration  distance  is  found  by  taking 
the  centroid  of  the  mixing  area  images  and  calculating  the  distance  from  the  centroid  to  the  surface. 
Another  parameter  that  represents  the  ability  of  the  transverse  jets  to  mix  with  the  freestream  is  the 
growthrate.  The  growthrate  represents  how  quickly  the  jet  is  entrained  into  the  free  stream  as  the 
jet  develops  downstream.  From  the  images  a  growthrate  (dAp/dx)  can  be  calculated  from  the  slope 
of  a  linear  curve  fit  through  the  three  average  mixing  areas  calculated  at  each  streamwise  location. 
It  should  be  noted  that  there  are  many  other  definitions  which  could  be  used  to  define  the  “best” 
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mixing  and  it  may  depend  on  characteristics  specific  to  the  application.  Also  any  one  of  these  three 
definitions  may  be  best  suited,  therefore  the  hole  pattern  was  optimized  for  each  individual  mixing 
definition  (i.e.  mixing  area,  penetration,  and  growthrate).  It  could  be  argued,  however,  that  all  these 
mixing  definitions  are  important  and  so  a  combined  figure  of  merit  is  calculated  representing  a  forth 
mixing  definition.  The  combined  figure  of  merit  (CFOM)  is  given  by 


CFOM  = 


+  IAP1  dx]2 

[Ap0ldx]2 


(3) 


where  each  mixing  definition  is  normalized  by  its  value  for  the  baseline  single  circular  jet  case 
denoted  by  the  subscript  0.  It  should  be  noted  that  individual  mixing  definitions  were  weighted 
equally  in  the  present  study,  but  it  may  be  desirable  in  the  future  to  consider  one  or  more  definitions 
more  important.  Also  the  CFOM  is  normalized  to  unity  for  the  best  case. 

After  the  data  is  taken  from  each  manufactured  jet  geometry  and  the  four  mixing  definitions 
are  evaluated  the  SAWI  optimization  algorithm  is  initiated  (Step  E).  Within  the  SAWI  algorithm 
described  above  the  large  design  space  is  reduced  around  promising  regions  and  a  least  square 
polynomial  is  fit  to  the  data  in  this  region.  Note  again  that  the  polynomial  was  made  to  be  the  lowest 
order  possible  to  describe  the  “optimal  region”  (instead  of  the  entire  data  set)  as  defined  by  the 
residue  between  the  function  and  experimental  data  (in  general  the  residue  was  less  than  5%  in  these 
regions).  Candidate  designs  (i.e.  jet  orientations)  for  each  FOM  to  be  manufactured  were  selected 
(Step  F)  if  they  were  significantly  different  (greater  than  2%)  than  designs  already  evaluated  as  well 
as  higher  than  the  baseline  case.  Therefore  new  designs  were  evaluated  which  may  be  less  than  the 
current  optimum,  but  enable  the  design  space  to  be  checked  in  promising  regions.  The  new  designs 
were  then  manufactured  (Step  A)  and  the  process  was  repeated.  Termination  of  the  optimization 
(Step  G)  was  decided  when  no  significantly  different  points  were  suggested  by  the  SAWI  algorithm 


8-11 


and  when  the  predicted  values  and  actual  values  of  the  FOM  were  within  5%.  Following  is  the 
results  of  using  this  experimental  methodology  to  optimize  two  and  three  injector  geometries  for 
mixing. 

Two  Injector  Optimization 

In  order  to  test  the  experimental  methodology  and  optimization  algorithm  the  first  case 
studied  was  a  two  injector  geometry.  This  allowed  the  design  space  to  be  visualized  since  the 
number  of  design  parameters  with  constraints  is  reduced  to  two  with  the  FOM  evaluated  (i.e.  mixing 
area,  penetration,  or  the  combined  figure  of  merit)  as  the  third  dimension  of  the  surface.  Also,  it  is 
much  easier  to  understand  the  physics  of  the  mixing  since  there  is  less  interaction  than  the  three  jet 
geometry.  For  the  two  injector  “proof  of  concept”  experiments  the  growthrate  was  eliminated  from 
the  analysis  so  that  the  number  of  tunnel  runs  could  be  reduced  with  only  one  streamwise  location 
needed  to  calculate  the  mixing  area  and  penetration  depth.  Images  were  taken  of  the  single  injector 
baseline  case  so  that  the  mixing  definitions  (i.e.  figure  of  merit,  FOM)  could  be  calculated.  The 
diameter  of  the  single  injector  case  had  the  same  total  exit  area  as  the  two  jet  geometries  resulting 
in  a  jet  diameter  of  3.46  mm  for  the  single  jet  and  2.54  mm  for  the  two  jets. 

Four  variables  are  needed  to  describe  the  locations  of  the  two  holes  on  a  Cartesian  plane  (i.e. 
x„  z„  x2,  Z2)  as  shown  in  Figure  5.  The  problem  is  constrained  in  that  the  holes  must  be  located  on 
the  injector  disk  with  a  radius  (R^k)  of  12.2  mm.  The  optimization  problem  then  can  be  represented 
as  follows: 
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maximize  f(x) 


Subject  to: 


2  ^  2  n  2 
x,  +z,  <R 


2  2 

X,  +z ,  <R 


disk 
2  disk 


X,  +x2=0 

Zl+Z2=0 


-a0+alx]  +  a2zl  +  a3x2+a4  z2 
ai  x\+  as  z\  +  a9  x22+  a,0  z22+... 


where  f(x)  is  the  figure  of  merit  to  be  optimized  and  ^  the  coefficients  of  the  polynomial.  The  x  and 
z  centroid  of  the  hole  pattern  was  constrained  to  be  at  the  center  of  the  disk.  The  z  centroid  was 
constrained  to  the  center  to  prevent  physical  redundancy  and  the  x  centroid  to  allow  for  comparison 
between  cases.  In  addition,  since  interchanging  x  with  -x  or  z  with  -z  produces  the  same  physical 
geometry,  only  even  terms  are  needed  to  fit  the  data.  This  allows  for  odd  terms  to  be  removed  from 
the  function  and  higher  order  curve  fits  can  be  used.  These  equalities  reduce  the  problem  to  that  of 
two  design  variables. 

Figure  6  illustrates  the  progression  of  the  mixing  definitions  of  mixing  area,  penetration,  and 
combined  figure  of  merit  (CFOM)  for  each  iteration  step  including  the  single  jet  baseline  case  at 
iteration  number  zero.  Initially,  eight  geometries  were  chosen  which  would  span  the  domain 
appropriately  with  the  SAWI  optimization  routine  suggesting  at  least  one  new  geometry  for  each 
FOM  at  successive  iterations.  The  value  of  the  FOM  for  all  geometries  are  shown  at  each  iteration 
even  though  some  are  used  to  optimize  another  FOM.  In  all  cases  the  best  figure  of  merit  was  found 
at  the  last  iteration  although  at  the  intermediate  iteration  (iteration  #2)  the  figure  of  merit  dipped 
down  slightly.  This  is  due  to  the  SAWI  algorithm  and  the  fact  that  most  optimum  values  are  only 
evaluated  if  they  are  significantly  different  than  geometries  already  evaluated.  Table  1  gives  the 
baseline  (single  injector)  and  optimized  figure  of  merits  for  mixing  area,  penetration,  and  CFOM. 
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In  all  mixing  definitions  the  FOM  was  improved  (82%  for  mixing  area  and  43%  for  the  CFOM),  but 
penetration  had  the  least  improvement  of  only  7.3%. 

Figures  7  and  8  give  the  physical  injector  layout,  average  image,  and  one  instantaneous  image 
for  the  single  jet  baseline  case,  best  mixing  area  geometry,  penetration  geometry,  and  CFOM 
geometry.  The  single  jet  case  in  Figures  7a  and  7b  show  typical  transverse  jet  injection 
characteristics  with  counter-rotating  vortex  pair  clearly  visible  in  the  average  and  instantaneous 
images.  The  injector  geometry  which  optimizes  the  best  mixing  area  (Figure  7c  and  d)  arranges  the 
injectors  in  a  line  perpendicular  to  the  flow.  This  is  physically  reasonable  since  holes  spaced  far 
apart  would  behave  more  like  two  independent  jets  producing  greater  total  mixing  area  than  when 
the  jets  interfere  greatly.  It  should  be  noted  however  that  the  spacing  was  not  to  the  maximum  edge 
of  the  design  domain  suggesting  there  is  still  some  interaction.  The  geometry  and  resulting  images 
giving  the  optimized  penetration  depth  is  shown  in  Figure  8a,b.  This  geometry  has  the  injectors 
lined  up  with  the  flow  direction  and  wide  apart,  but  again  not  to  the  maximum  edges  of  the  design 
domain  which  seems  reasonable  since  it  would  be  expected  that  the  wake  and  counter-rotating  for 
the  first  jet  would  enhance  the  penetration  depth  of  the  second  jet.  The  average  and  instantaneous 
images  show  a  similar  flow  pattern  to  the  single  jet  case  with  the  counter  rotating  vortices  clearly 
observed.  Figure  8c, d  show  the  geometry  for  the  CFOM  and  resulting  average  and  instantaneous 
image.  This  geometry  is  the  same  as  the  mixing  area  geometry  for  reasons  which  will  be  discussed 
shortly. 

Since  there  are  only  two  design  variables  for  the  two  injector  geometry,  a  surface  plot  can 
be  made  for  each  mixing  definition  with  the  FOM  used  for  the  third  dimension.  Figure  9  gives  the 
surface  plot  for  mixing  area,  penetration,  and  CFOM.  A  sixth  order  polynomial  was  found  to  fit  the 
data  sets  with  less  than  5%  standard  deviation  from  the  function  and  actual  data.  Upon  performing 
the  optimization  on  the  individual  area  function,  the  results  gave  optimized  geometries  primarily 
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with  the  holes  in  a  line  perpendicular  to  the  flow.  This  can  be  seen  graphically  in  Figure  9a  where 
the  objective  function  grows  as  the  outside  edges  of  the  disk  are  reached  (z=±12.2  mm).  Similarly, 
when  penetration  depth  was  the  individual  objective  function,  the  results  indicated  that  the  best 
geometry  was  for  injectors  lined  up  along  the  flow  direction  as  can  be  seen  in  Figure  9b  where  two 
distinct  peaks  occur  for  zf=0  and  x=±12.2  mm.  As  mentioned  previously  these  geometries  allow  the 
jets  to  interfere  constructively  and  reinforce  each  other  to  penetrate  the  free  stream.  As  seen  in 
Figure  9c  the  CFOM  optimization  gave  similar  results  and  function  shape  to  the  area  optimization. 
This  is  due  to  the  fact  that  the  area  changes  by  up  to  82%  while  the  penetration  changes  by  at  most 
7.3%.  Since  it  was  assumed  that  these  two  objectives  were  equally  important  in  defining  mixing, 
the  figure  of  merit  calculation  weighs  these  two  objectives  equally.  Therefore,  the  changes  in  area 
outweigh  the  changes  in  penetration  and  dictate  the  CFOM. 

Three  Injector  Geometry 

Now  that  the  optimization  algorithm  has  been  shown  to  give  reasonable  results  for  the  two 
injector  geometry,  the  complexity  of  the  geometry  and  analysis  was  increased.  The  second 
optimization  experiment  involved  finding  the  location  of  three  injectors.  Also  the  growthrate  was 
incorporated  as  an  additional  FOM  evaluated  individually  and  added  into  the  CFOM.  This  geometry 
proved  to  be  more  interesting  since  the  effects  of  jet  interactions  were  enhanced.  Here,  the 
geometries  were  no  longer  symmetric  with  respect  to  the  z-axis  (ie.  x  can  not  be  interchanged  with 
-x)  so  odd  terms  in  x  were  required.  Using  similar  constraints  as  in  the  two-hole  case  and  forcing 
one  of  the  holes  to  lie  along  the  x-axis  ( z=0  ),  the  problem  could  be  reduced  to  three  variables.  This 
still  presents  the  problem  of  plotting  and  viewing  the  data  since  three  variables  are  needed  to 
represent  the  geometries  and  a  fourth  to  represent  the  objective  function  value. 

Six  variables  were  needed  to  describe  the  locations  of  the  three  holes  on  a  Cartesian  plane 
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(i.e.  x„  z„  x2,  z2,  x3,  z3)  as  shown  in  Figure  10.  The  diameter  of  the  single  injector  case  had  the  same 
total  exit  area  as  the  three  jet  geometries  resulting  in  a  jet  diameter  of  6.35mm  for  the  single  jet  and 
3.67mm  for  the  three  jets.  The  problem  is  constrained  in  that  the  holes  must  be  located  on  the 
injector  disk  with  a  radius  (Rdjsk)  of  10.6  mm  and  that  the  x  and  z  values  must  be  in  ascending  order. 
This  last  constraint  prevents  redundancy  in  the  configurations  since  interchanging  holes  would 
produce  the  same  result.  The  optimization  problem  then  can  be  represented  as  follows: 

maximize  f(x:)=a0  +  a,  x{  +  a2z2  +  a3  x2+  aA  z2  +  a$x3  +  aez3  + 

ar A +  °8  A  +  a9  A  +axo  A  +  °uA  +  a!2  A  +~ 

Subject  to: 

x}<x2<x3  OR  zx<z2<z3 

A+A±rL 

A+A<  R2disk 

7  2  2 

X3  +Z3  *  Rdisk 

*l+*2+X3=0 

y  \+z2+z3~® 


To  simplify  this  procedure,  one  hole  is  constrained  to  be  on  the  centerline  (z^O),  making  one 
of  the  z  values  zero.  Also,  the  x  and  z  centroid  of  the  holes  were  constrained  to  be  at  the  center  of 
the  disk.  The  z  centroid  was  constrained  to  the  center  to  prevent  physical  redundancy  and  the  x 
centroid  to  allow  for  comparison  between  cases.  These  three  equalities  reduce  the  problem  to  that 
of  three  design  parameters.  Also,  an  addition  constraint  was  chosen  based  on  the  difference  of  new 
geometries  from  those  already  evaluated.  If  the  difference  of  the  new  geometries  was  less  than  2% 
other  geometries  were  chosen.  This  means  that  the  last  iteration  may  not  show  improved  values,  but 
the  diversity  of  the  geometries  evaluated  is  always  increased. 

Table  2  gives  the  baseline  (single  injector)  and  optimized  figure  of  merits  for  mixing  area, 
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penetration  depth,  growthrate,  and  the  CFOM.  In  all  mixing  definitions  the  figure  of  merit  was 
improved  (66%  for  mixing  area,  12%  for  growthrate,  and  9%  for  the  CFOM),  but  again  penetration 
had  the  least  improvement  of  only  2%.  Figures  1 1  and  12  give  the  physical  injector  layout,  average 
image,  and  one  instantaneous  image  for  the  single  jet  baseline  case,  best  mixing  area  geometry, 
penetration  geometry,  and  CFOM  geometry.  The  single  jet  case  in  Figure  1  la  and  1  lb  again  show 
a  typical  transverse  jet  injection  characteristics  as  described  previously.  The  injector  geometry  which 
optimizes  the  best  mixing  area  (Figures  1 1  c,d)  arranges  the  injectors  in  a  line  spread  out,  but  not 
totally  perpendicular  to  the  flow  like  the  two  hole  geometry.  This  indicates  that  the  interaction 
between  the  jets  is  much  more  important  for  the  three  injector  geometry.  The  geometry  and  resulting 
images  giving  the  optimized  penetration  depth  is  shown  in  Figures  12a,b.  This  geometry,  similar 
to  the  two  hole  geometry,  has  the  injectors  lined  up  with  the  flow  direction  with  a  wide  separation. 
Again  this  geometry  seems  reasonable  since  the  wake  and  counter-rotating  for  the  first  jet  would 
enhance  the  penetration  depth  of  the  second  jet.  The  average  and  instantaneous  images  how  the 
existence  of  counter  rotating  vortices  clearly  observed.  Figures  12c,d  show  the  geometry  for  the  best 
growth  rate  and  resulting  average  and  instantaneous  image.  The  three  injectors  in  this  geometry  is 
aligned  with  the  flow  to  maximize  the  interaction  between  the  jets.  It  is  also  interesting  to  note  that 
the  average  and  instantaneous  images  clearly  show  the  counter-rotating  vortex  structure  which  may 
be  essential  to  maximize  the  growthrate.  The  growth  rates  over  the  domain  had  the  most  variation 
of  the  FOM’s  evaluated  ranging  from  0  to  1.25  mm2/mm.  Physically,  this  would  indicate  that 
injectors  separated  by  some  finite  distance  with  pronounced  interference  effects  provide  the  optimum 
growthrate.  The  CFOM  has  the  same  geometry  as  the  growthrate  due  to  the  fact  that  the  growthrate 
has  much  higher  variation  between  geometries  and  therefore  has  the  greatest  effect  since  the  FOM 
are  equally  weighted. 

The  main  difficulty  involved  with  the  three  hole  case  was  getting  a  physical  picture  of  the 
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objective  function  due  to  the  presence  of  the  third  independent  variable,  thus  becoming  a  four 
dimensional  problem.  As  a  result  interpretation  for  the  design  domain  is  much  more  difficult. 

Conclusions 

Experiments  were  performed  on  transverse  injection  into  a  supersonic  flow  using 
optimization  techniques.  The  first  experiment  involved  finding  the  best  position  for  two  equal 
diameter  injectors  which  gave  the  best  cross-sectional  area  and  penetration  depth,  both  individually 
and  combined.  It  was  found  that  changes  in  area  outweighed  changes  in  penetration  depth  and  was 
the  greatest  when  the  two  injectors  were  placed  far  apart  in  a  line  perpendicular  to  the  flow.  The 
second  experiment  involved  finding  the  best  position  for  three  equal  diameter  holes  with  the  added 
objective  of  optimum  growth  rate.  Although  results  could  not  be  as  easily  interpreted  as  in  the  two 
injector  case,  they  suggest  that  optimum  solutions  combining  these  three  parameters  exist  for 
geometries  with  injectors  at  intermediate  distances  from  the  center  and  give  improved  mixing  as 
compared  to  an  equivalent  single  hole.  Further  research  beyond  the  scope  of  this  preliminary 
investigation  will  involve  studying  other  optimization  routines  applied  to  experimental  analysis, 
consider  other  mixing  definitions  experiments  which  would  explain  the  improvements  in  mixing, 
and  more  complicated  geometries  better  utilizing  the  benefits  of  stereo-lithography.  Aspects  of  this 
research  program  has  been  accepted  for  presentation  in  ASME  and  AIAA  fluids  conferences. 
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Table  1.  Comparison  of  the  optimized  two  injector  geometries  with  the  baseline  case 


Model 

Ip 

IB 

fgpg 

luS&n 

m 

|g| 

mixing 
area  [mm2l 

penetration 

fmml 

CFOM 

%  increase 
of  FOM 

Single  injector 

0 

0 

0 

0 

322 

9.82 

0.698 

♦ 

Optimized  for  mixing  area 

11.76 

0 

-11.76 

0 

588 

8.62 

1 

82% 

Optimized  for  penetration 

*0 

9.31 

0 

-9.31 

457 

10.6 

0.88 

7.3% 

Optimized  for  CFOM 

11.76 

0 

-11.76 

0 

588 

8.62 

1 

43% 

Table  2.  Comparison  of  the  optimized  three  injector  geometries  with  the  baseline  case 


Model 

(-)zl 

[mm] 

(-)xl 

[mm] 

(-)zl 

[mm] 

(-)x2 

[mm] 

(-)z3 

[mm] 

(-)x3 

[mm] 

Mixing 

area 

[mm2] 

penetration 

[mm] 

growth  rate 
[mm2/mm] 

CFOM 

%  incr. 
of  FOM 

0 

0 

0 

0 

0 

0 

469 

15.3 

0.0443 

0.949 

* 

Optimized  for 
mixing  area 

-9.61 

-4.73 

0 

-0.58 

9.61 

5.32 

778 

9.48 

0 

0.982 

66% 

Optimized  for 
penetration 

0 

-11 

0 

0 

0 

11 

566 

15.6 

0.0161 

0.887 

2% 

Optimized  for 
growthrate 

0 

-4.89 

0 

10.9 

0 

-6.11 

511 

14.6 

0.0495 

1 

12% 

Optimized  for 
CFOM 

0 

-4.89 

0 

10.9 

0 

-6.11 

511 

14.6 

0.0495 

■ 

9% 
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Counter-Rotating 
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Figure  1 .  Perspective  view  of  the  mean  transverse  injection  flowfield. 


Mirror 


Nd:YAG  Laser 


Window 


ICCD  Camera 


Figure  2.  Plan  view  of  the  tunnel  and  optical  arrangement  for  measurements  of  a  iet  in  a 
supersomc  cross  flow.  J 


Figure  3.  Schematic  of  optimization  procedure. 


Figure  4.  Schematic  of  mixing  area  (Ap)  and  penetration  (yp)  definitions. 
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injector  geometry. 
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Appendix  A 


Following  is  the  complete  data  of  all  the  experiments  performed  for  two  injector  and  three 
injector  geometries.  The  injector  positions  are  given  in  coordinates  which  are  shown  in  Figures  5 
and  10. 


Table  3.  Two  Injector  Optimization 


(-)zl 

[mml 

(-)xl 

_ tmm] 

!l 

(-)x2 

[mm] 

mixing  area 
fmm2l 

penetration 

[mml 

CFOM 

0 

0 

0 

0 

321.9348 

9.8209 

0.6975 

0.50715 

1.89385 

-0.50715 

-1.89385 

356.1283 

9.1281 

0.7126 

1.89385 

0.50715 

-1.89385 

-0.50715 

345.8058 

8.4765 

0.6796 

1.77625 

6.62725 

-1.77625 

-6.62725 

403.8702 

9.525 

0.782 

3.11885 

3.11885 

-3.11885 

-3.11885 

492.9022 

8.529 

0.8681 

6.62725 

1.77625 

-6.62725 

-1.77625 

517.4183 

8.0311 

0.8893 

6.58315 

6.58315 

-6.58315 

-6.58315 

500.6442 

8.0996 

0.8681 

3.0429 

11.3582 

-3.0429 

-11.3582 

495.4829 

9.4375 

0.8948 

1  1.3582 

3.0429 

-11.3582 

-3.0429 

587.7408 

8.5087 

0.9966 

5.4047 

4.2238 

-5.4047 

-4.2238 

460.9462 

8.1305 

0.8157 

10.2851 

5.70115 

-10.2851 

-5.70115 

554.8518 

7.8801 

.  0.9376 

5.6203 

3.9347 

-5.6203 

-3.9347 

436.3488 

7.7414 

0.7733 

7.62685 

5.341 

-7.62685 

-5.341 

537.7105 

8.0662 

0.9179 

11.76 

0 

-11.76 

0 

588.3859 

8.6243 

1 

0 

9.31 

0 

-9.31 

457.4184 

10.5946 

0.8798 

6.5758 

9.74855 

-6.5758 

-9.74855 

503.87 

8.2798 

0.8767 
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Table  4.  Two  Injector  Optimization 


(-)zl 

[mm] 

(-)xl 

[mm] 

(-)zl 

[mm] 

(-)x2 

[mm] 

(-)z3 

[mm] 

(-)x3 

[mm] 

Mixing 

area 

rmm2l 

penetration 

[mm] 

growth  rate 
[mm2/mm] 

CFOM 

0 

0 

0 

0 

0 

0 

469.982 

15.3197 

0.0443 

0.9488 

-9.525 

5.4991 

0 

-10.998 

9.525 

5.4991 

593.009 

11.5623 

0.0152 

0.8271 

-4.7625 

-2.7495 

0 

5.4991 

4.7625 

-2.7495 

495.401 

14.4107 

0.0222 

0.821 

-4.7625 

2.7495 

0 

-5.4991 

4.7625 

2.7495 

494.095 

12.4507 

0.0265 

0.7982 

-10.998 

0 

0 

0 

10.9982 

0 

548.167 

11.747 

0.0162 

0.7903 

0 

-10.998 

0 

0 

0 

10.9982 

566.495 

15.5937 

0.0161 

0.8868 

-3.9992 

0 

0 

0 

3.9992 

0 

493.573 

14.0191 

0.0168 

0.7906 

0 

-3.9992 

0 

0 

0 

3.9992 

471.342 

13.5852 

0.0264 

0.8027 

0 

-5.4991 

0 

-5.4991 

0 

10.9982 

655.215 

13.8217 

0.0181 

0.9368 

-9.6088 

5.3492 

0 

-1.8974 

9.6088 

-3.4519 

644.171 

9.9507 

0.0105 

0.8409 

0 

-4.887 

0 

10.9982 

0 

-6.1112 

510.895 

14.5661 

0.0495 

1 

-8.824 

-3.4595 

0 

-3.1064 

8.824 

6.5659 

672.593 

10.5518 

0.0343 

0.9675 

-4.9708 

-9.812 

0 

0 

4.9708 

9.812 

584.406 

13.6923 

0.0383 

0.9634 

-2.0091 

-8.0086 

0 

-2.8042 

2.0091 

10.8128 

570.999 

14.04 

0.0131 

0.8492 

-2.032 

-2.8321 

0 

-2.8321 

2.032 

5.6642 

578.115 

11.6169 

0.0169 

0.8186 

-4.638 

-0.414 

0 

-5.8572 

4.638 

6.2713 

667.865 

9.8919 

0.0057 

0.8579 

-6.162 

-9.05 

0 

9.8908 

6.162 

-0.8407 

684.751 

11.6782 

0 

0.9049 

-9.6088 

-4.7371 

0 

-0.5842 

9.6088 

5.3213 

778.068 

9.4794 

0 

0.9818 

-0.5715 

-10.49 

0 

7.6048 

0.5715 

■2.8854 

614.181 

13.2856 

0 

0.8662 

Appendix  B 

Although  not  part  of  the  optimization  procedure  another  geometry  that  was  tested  was  the 
effect  of  crossing  oblique  shock  waves  on  the  mixing  of  a  transverse  jet  injected  in  a  supersonic  free 
stream.  This  geometry  is  illustrated  in  Figure  B I .  The  crossing  shocks  were  believed  to  be  possible 
mixing  enhancement  mechanisms  due  to  the  work  conducted  with  side  wall  shock  enhancement  in 
planar  mixing  layers  and  the  existence  of  counter-rotating  vortices  in  the  boundary  layer  in  these 
flowfields.  The  flow  conditions,  seeding,  mixing  definitions,  and  optical  arrangement  were  the  same 
as  the  single  injector  geometry  described  previously  for  the  three  injector  optimization.  The  start  of 
the  shocks  was  approximately  132  mm  before  the  center  of  the  circular  jet  and  the  two  wedges  were 
inclined  at  7°  relative  to  the  free  stream.  Thus  the  crossing  shocks  were  formed  before  the  injector 
location. 

Table  B 1  gives  the  mixing  area,  penetration  depth,  and  CFOM  of  merit  for  the  single  jet  with 
and  without  the  crossing  shocks.  As  seen  here  the  crossing  shocks  tend  to  suppress  the  mixing  for 
all  definitions  instead  of  increasing  them.  The  reason  for  this  is  evident  when  observing  the  average 
and  instantaneous  images  given  in  Figure  B2.  The  counter  rotating  vortex  tends  to  keep  the  jet  from 
injecting  into  the  free  stream  suppressing  the  jet  into  the  boundary  layer. 

Due  to  the  fact  that  initial  experiments  showed  no  level  of  improvement  and  from  discussions 
with  the  project  monitor  few  experiments  were  conducted  on  this  flow  making  the  emphasis  of  the 
research  on  the  multiple  injector  optimization.  It  should  be  kept  in  mind,  however,  that  only  one  jet 
position  relative  to  the  crossing  shocks  was  conducted  and  generalizing  these  findings  to  all 
geometries  is  not  suggested  without  a  more  detailed  investigation. 
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Table  B 1 .  Effect  of  crossing  shocks  on  the  mixing  characteristics  of  transverse  jet  injection 


Model 

Mixing  Area 
[mm2] 

Penetration 

[mm] 

CFOM 

Without  Crossing  Shocks 

469.98 

15.32 

1.414 

With  Crossing  Shocks 

351.60 

14.87 

1.226 
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Fin 


Figure  B 1 .  Schematic  of  a  transverse  jet  in  a  supersonic  cross  flow  with  crossing-shock 
interaction. 
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Figure  B2.  Span  wise  view  images  for  the  averaged  and  instantaneous  single  injector  without 
crossing  shocks  (a,b)  and  (c.d),  and  averaged  and  instantaneous  single  injector  with  crossing 
shocks. 
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DEVELOPMENT  OF  MULTIPLE  METAL-DIELECTRIC  LAYERS  FOR  3-D  MMIC 

Altan  M.  Ferendeci 

Department  of  Electrical  and  Computer  Engineering 
and  Computer  Science 
University  of  Cincinnati 

INTRODUCTION 

At  present,  solid  state  power  sources  can  not  replace  the  high  power  slow-wave  microwave 
tubes.  But  by  incorporating  novel  power  combining  techniques,  the  overall  combined  power  of  the 
solid  state  sources  may  approach  that  of  a  single  tube  output.  This  is  especially  true  for  phased 
array  antenna  systems  where  each  radiator  has  its  own  power  amplifier  and  the  total  radiated  power 
is  the  combined  power  of  all  the  radiators.  In  a  phased  array  antenna  system,  the  transmit/receive 
(T/R)  modules  are  placed  on  a  line  (linear  array),  on  a  plane  (planar  array)  or  on  a  circular  area 
(circular  array).  In  all  cases,  the  separation  distance  between  the  antennas  should  be  less  than  half 
the  free  space  wavelength  of  the  radiated  signal  to  eliminate  the  main  side  lobes. 

At  microwave  and  millimeter  wave  frequencies,  planar  antennas  are  replacing  the 
conventional  antenna  structures.  The  dimensions  of  the  patch  antennas  are  generally  less  than  the 
required  free  space  half-wavelength.  They  can  be  configured  to  be  used  in  any  array  configuration. 
Unfortunately,  if  a  compact  module  including  all  the  electronics  is  to  be  integrated  with  the  patch 
antenna,  it  is  impossible  to  place  all  the  individual  circuit  modules  on  the  same  substrate  and  at  the 
same  time  satisfy  the  antenna  spacing  requirements.  This  becomes  more  important  especially  at 
millimeter  wavelengths.  Even  though  most  of  the  system  components  are  in  the  form  of  monolithic 
integrated  circuits,  they  still  occupy  a  large  surface  area  of  a  substrate.  Even  if  the  complete  circuit 
can  be  laid  out  on  the  same  GaAs  or  InP  substrate,  it  will  be  too  expensive  to  use  this  type  of  large 
area  substrate  behind  each  of  the  radiator.  Even  if  lumped  elements  are  replaced  by  distributed 
elements  at  higher  frequencies,  the  overall  T/R  module  may  still  require  a  large  substrate  area.  The 
complete  transmit/receive  (T/R)  module  also  contains  a  high  power  amplifier.  Unless  precautions 
are  taken  to  remove  the  generated  heat,  placing  this  circuit  on  the  same  substrate  with  other 
temperature  sensitive  circuits  such  as  low  noise  preamplifiers  may  not  be  desirable. 

At  present,  all  circuit  modules  are  placed  in  an  enclosed  casing.  The  power  is  coupled  to  the 
antenna  through  one  end  of  the  casing.  Unfortunately,  this  increases  the  overall  depth  of  the  T/R 
module  relative  to  the  lateral  dimensions  of  the  antenna.  These  modular  units  have  the  distinct 
disadvantage  of  having  a  surface  area  to  depth  ratio  very  small.  Even  with  these  limitations,  they 
can  still  be  utilized  in  a  linear  or  a  planar  array.  They  can  not  be  distributed  over  a  general  curved 
surface.  For  a  module  to  be  universally  confomral  to  any  surface,  the  depth  of  the  module  should 
be  negligibly  small  compared  to  the  maximum  lateral  dimensions  of  the  modular  unit. 
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It  is  possible  to  place  the  circuit  modules  vertically  one  over  the  other  so  that  the  final  T/R 
module  will  occupy  less  space.  3D  module  configuration  dictates  the  development  of  a  new 
packaging  technology  so  that  various  circuit  modules  at  various  levels  can  be  vertically 
interconnected  with  each  other  with  little  or  no  loss  in  signal  amplitude,  with  no  cross  coupling 
between  the  elements  and  with  manageable  processing  steps. 

There  is  already  considerable  research  and  development  effort  being  extended  to 
incorporate  vertical  interconnects  in  the  implementation  of  monolithic  microwave  integrated  circuits 
(MMIC)[1,2].  There  are  two  major  concentrated  research  areas  that  utilize  3D  vertical 
interconnects.  First  area  is  the  use  of  vertical  interconnects  for  compacting  the  size  of  an  integrated 

circuit  module  [3].  In  this  scheme,  thin  multiple  dielectric  layers  (2.5  pm)  are  used  for  connecting 

the  various  elements  of  the  circuit  together.  The  second  major  concentrated  effort  is  to  stack 
various  circuit  modules  of  the  overall  T/R  module  one  over  the  other  using  vertical  interconnects. 
Actually  both  of  these  concepts  complement  each  other  since  the  final  T/R  module  may  contain 
various  vertically  interconnected  MMICs. 

In  either  case,  coupling  of  microwave  or  millimeter  wave  signals  between  the  layers 
become  a  major  area  of  research.  Already,  there  is  considerable  research  work  going  on  related  to 
the  coupling  mechanisms  between  transmission  lines  of  the  various  layers  [4].  Novel  transmission 
lines  are  also  proposed  and  analyzed  [5-9]. 

In  this  proposal,  development  of  a  new  technology  to  implement  vertically  interconnected 
3D  T/R  module  for  a  conformal  phased  array  antenna  system  is  proposed.  Details  of  the  proposed 
development  based  on  deposition  of  multiple  dielectric  and  conductor  layers  including  an  integral 
patch  antenna  will  be  given.  The  extensions  necessary  for  major  components  of  the  T/R  module 
and  the  specific  areas  that  have  to  be  developed  will  be  discussed. 

PHASED  ARRAY  ANTENNA  MODULE 

Typical  front  end  of  a  T/R  module  for  a  radar  system  is  shown  Figure  1 .  Figure  2  shows  a 
2x2  modular  phased  array  antenna  system.  The  control  and  processing  electronics  are  not  shown  in 
these  figures.  The  lowest  substrate  layer  contains  the  power  amplifier.  Heat  can  be  easily 
transferred  away  from  the  substrate  by  placing  the  overall  module  on  a  highly  heat  conducting 
metallic  surface. 

Various  layers  of  the  T/R  module  are  processed  by  deposition  of  multiple  dielectric 
materials  of  known  thickness  followed  by  metal  depositions  immersed  between  the  dielectric 
layers.  As  the  module  is  processed,  necessary  interconnecting  posts  (or  vias)  are  electro-deposited 
and  the  circuit  for  that  layer  is  finally  processed. 
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The  upper  layer  of  the  module  has  a  patch  antenna  which  could  be  either  rectangular, 
circular  or  ring  in  shape  [10].  The  thickness  of  the  upper  dielectric  layer  below  the  patch  can  be 
adjusted  to  provide  the  maximum  radiation  condition  for  a  specific  antenna  [1 1], 


Figure  1.  Major  components  of  a  T/R  module. 


Typical  dimensions  for  patch  antennas  operating  at  10  GHz  and  with  a  dielectric  material  of 

e=2.5  are  given  in  Table  I.  The  overall  dimensions  of  these  patches  are  smaller  than  fJ2=15  mm 

and  thus  can  be  used  in  phased  array  antenna  systems  since  they  easily  satisfy  the  phased  array 
antenna  spacing  requirements. 


Table  I.  Dimensions  of  a  patch  antennas 
( f=10  GHz,  Xj2  =  15  mm,  £=2.5  ) 


Patch 

Dimensions  (mm) 

Directivity  (dB) 

Rectangular 

w=ll  J,  L=10.8 

6.9 

wUUm 

D  =  10.8 

7.1 

Ring 

b=4.1,  a=2.6 

I 
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Figure  3.  Distribution  of  patch  antennas  over  a  line,  plane  or  a  cylinder. 


Figure  3  shows  possible  distribution  of  vertically  interconnected  3D  T/R  modules  over 
various  surface  configurations.  These  surfaces  can  actually  be  of  any  shape  provided  the  radius  of 
curvature  of  the  surface  is  not  too  small  compared  to  the  lateral  dimensions  of  the  T/R  module. 
Since  the  overall  thickness  of  the  proposed  vertically  interconnected  3D  T/R  module  is  expected  to 
be  a  few  millimeters,  they  can  easily  be  distributed  over  any  conformal  surface  topology.  These 
properties  may  lead  to  vast  number  of  applications  in  communication  and  radar  systems. 


Patch  antenna 
Phase  shifter 
TR  - module 
LNA 
Mixer 
LO 

IF  amplifier 

Power  Amplifier 

LAYERS 


-..1 IV!  O  d  U  1  8  S 


Figure  2. 4  element  phased  array  antenna  system  with  3D  T/R  modules. 
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Figure  4  shows  the  essential  components  of  a  typical  representative  vertically 
interconnected  3D  T/R  module.  A  substrate  (preferably  a  Si-Silicon  or  Alumina)  is  the  base  of  the 
building  block.  Since  the  substrate  can  be  attached  to  another  heat  conducting  material  such  as  a 
metal,  the  power  amplifier  circuitry  is  processed  on  this  substrate. 


Circuit 

plane 


Syca  r 


Sycar 


Circuit 

plane 


Sycar 


Sycar 


Vertical 

Interconnects 


Transmission 

Lines 


Figure  4.  Representative  components  of  a  3D  module. 

In  addition  to  already  developed  circuit  modules  such  as  LNA,  Mixer,  TR  switch,  so  on, 
there  are  four  major  areas  that  have  to  be  further  developed  in  implementing  a  vertically 
interconnected  3D  T/R  module.  These  are: 

a)  Deposition  and  processing  of  dielectric  layers 

b)  Low  loss  vertical  interconnects 

c)  Transmission  lines  and  passive  circuit  components 

d)  Power  amplifier 

POWER  AMPLIFIER 


There  are  many  possibilities  for  the  implementation  of  the  power  amplifier.  The  complete 
power  amplifier  can  be  processed  as  a  monolithic  integrated  circuit  using  either  HBTs  or  FETs, 
Since  there  is  a  very  successful  development  program  on  thermally  shunted  HBT  power  devices 
(Figure  5)  at  the  Devices  Branch  of  the  Electronics  Division  of  the  Avionics  Laboratory  (AADD)  of 
the  Wright  Laboratory  [12],  it  will  be  an  excellent  opportunity  to  incorporate  these  devices  in  the 
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final  3D  T/R  modules.  This  will  also  allow  close  collaboration  between  the  various  branches  of  the 
Electronics  Division.  The  combined  expertise  of  the  various  branches  will  lead  to  the  development 
of  the  best  circuit  topology  for  maximum  operational  efficiency. 


Figure  5.  Thermally  shunted  power  HBT  developed  at  AADD  of  Avionics  Laboratory 

(M-14). 


Etch  #1 


Etch  #2 


(b) 


f\ _ n 


Metal 

Deposition 


Low  melting  point 


Thermal  SI-GaAs 

Shunt  Substrate 


Conductor 


Figure  6.  Micromachining  steps  in  Silicon  for  attaching  the  HBT  chip.  The 
transistor  shunt  metal  makes  direct  contact  with  Si-substrate. 


As  shown  in  Figure  6,  the  power  amplifier  chip  is  attached  to  the  Si-substrate  by 
micromachining  an  opening  on  the  Silicon  surface  and  flip  bonding  of  the  chip  into  the  opening 
[13,14],  Since  the  dielectric  thickness  of  the  vertical  layers  are  relatively  thin  and  their  thickness 
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may  be  comparable  to  the  overall  height  of  a  thermally  shunted  HBT,  it  is  recommended  that  the 
micromachining  depth  should  be  as  deep  as  the  transistor  height  so  that  the  ensuing  dielectric  layers 
can  be  deposited  uniformly  over  the  substrate  material 

TRANSMISSION  LINES 

There  are  many  transmission  line  configurations  that  can  be  used  in  the  implementation  of 
the  vertically  interconnected  3D  T/R  modules  [15].  Microstrip  lines,  strip  lines,  slot  lines  and 
coplanar  waveguides  are  the  best  possible  candidates  that  can  be  used  with  these  modules.  Because 
of  the  unsymmetric  nature  of  the  slot  line,  it  has  limited  applicability  and  in  general  it  is  used  for 
line  couplings  between  various  layers.  In  many  cases,  circuit  planes  will  be  sandwiched  between 
two  conducting  ground  planes.  Therefore  microstrip  lines  are  replaced  by  stripline  type 
transmission  lines.  If  the  upper  dielectric  layer  of  a  circuit  plane  is  relatively  thick  compared  to  the 
lower  dielectric  layer,  microstrip  like  transmission  lines  can  still  be  implemented  with  a  slight 
modification  to  the  line  characteristic.  In  this  case,  the  strip  is  now  embedded  completely  in  a 
dielectric  material  of  er.. 


(c)  Coplanar  Waveguide 


(d)  Coupled  Coplanar  Waveguide 


Figure  7.  Transmission  line  suitable  for  3D  interconnects . 

In  selecting  transmission  line  structures  best  suited  for  the  proposed  3D  T/R  module,  the 
lines  should  have  low  loss,  easy  to  process  and  above  of  all  they  should  be  less  sensitive  to  the 
thickness  of  the  processed  dielectric  layers.  If  the  thickness  of  the  dielectric  layer  can  be  controlled 
accurately,  the  choice  of  the  transmission  line  structure  will  not  be  critical. 

For  a  circuit  plane  sandwiched  between  two  ground  planes,  striplines  (SL)  and  coplanar 
waveguides  (CPW)  are  the  two  best  transmission  line  configurations  that  can  be  used  in 
implementing  the  distributed  elements  of  a  vertically  interconnected  3D  module. 
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Figure  7  shows  a  stripline  and  a  coplanar  waveguide  in  the  presence  of  both  upper  and 
lower  conducting  planes.  If  the  upper  conducting  ground  plane  is  not  present,  the  stripline 
becomes  a  modified  microstrip  transmission  line  imbedded  completely  in  a  dielectric  material. 
Already  successful  power  coupling  configurations  from  one  layer  to  the  other  are  implemented 
when  the  lines  are  embedded  within  multiple  dielectric  layers  [2], 
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Figure  8.  Conformal  mapping  of  the  CPW  with  top  and  bottom 
ground  planes  inunersed  completely  in  a  dielectric  material. 


In  order  to  verify  the  suitability  of  CPW  for  the  proposed  3D  modules,  the  effect  of  the 
upper  and  lower  ground  planes  on  the  impedance  characteristic  of  the  coplanar  waveguide  is 
investigated.  The  geometry  given  in  Figure  8a  represents  a  typical  coplanar  waveguide  imbedded  in 
a  dielectric  material  sandwiched  between  two  conducting  planes  separated  by  distances  h;  and  h2. 
A  quasi-electrostatic  solution  for  the  CPW  can  be  found  by  using  conformal  transformations  [ lb- 
20] .  The  geometry  given  in  Figure  8a  is  treated  as  two  separate  problems,  one  for  the  upper  and 
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one  for  lower  parts  of  the  transmission  line.  Both  parts  can  be  transformed  into  the  z-plane  through 
the  transformation 


z  —  tanh(— ) 

2  h 

The  transformed  z-plane  coordinates  are  shown  in  Figure  8b.  The  z-plane  is  then 
transformed  into  the  w-plane  using  elliptic  transformations  (Figure  8c).  The  resulting  characteristic 
impedance  can  be  written  as 

/a*,]  ~  K(k,r (n)  <*> 

Wi)  m\y 

Here 

k3  =  tanh(^a/2/21)/tanh(^Z?/2/21) 
k2  =  tanh(^/2/22)/tanh(^/2/z2) 

and  K(k)  is  the  Complete  Elliptic  Integral  of  the  First  Kind  with  argument  k  and  k'  =  V 1  —  k1 . 
Since  the  whole  line  is  imbedded  in  a  dielectric  material,  the  £ef=£r. 


(  W  =  20  pm,  G=2pm  h=50gm) 


Figure  9.  Impedance  variation  as  a  function  of  ground  plane  to  CPW  distance  hfor 

£=2.5. 
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A  FORTRAN  program  is  written  and  the  thickness  of  the  dielectric  layers  between  the 
upper  and  lower  ground  planes  are  taken  to  be  equal.  For  a  gap  of  x4-x3=2  (im  and  width  of 

2x4=20  |im,  the  variation  of  the  line  impedance  as  a  function  of  the  dielectric  thickness  h  is  plotted 

as  shown  in  Figure  9.  The  impedance  is  not  effected  by  h  as  the  dielectric  thickness  becomes 
greater  than  twice  the  overall  separation  distance  of  the  two  side  ground  planes.  Therefore,  for 
dielectric  layer  thickness  greater  than  2(W+2S),  the  CPW  parameters  will  not  be  critically 
dependent  on  small  differences  in  the  thickness  of  the  deposited  dielectric  layers. 

DKT;‘“k-<k  ' 


-ID.  D 


~2W.fr 


-OO.D 

(a) 

Figure  9.  (a)  Layout  and  (b)  simulated  S  parameters  of  a  CPW  3 -section  coupled 

equal  ripple  filter. 

In  order  to  show  the  feasibility  of  utilizing  CPWs  as  distributed  circuit  elements,  a  3  section 
equal  ripple  coupled-line  filter  is  designed  and  simulated.  The  line  parameters  for  the  odd  and  even 
mode  impedances  of  the  coupled  CPWs  are  calculated  beginning  from  a  low  pass  filter  prototype 
using  the  insertion  loss  method.  The  physical  dimensions  of  the  coupled  CPW  lines  are  then  found 
from  the  Line_Calc  program  of  Libra  simulation  software.  These  are  tabulated  in  Table  II  for  h=50 

pm  and  e-2.5.  The  layout  of  the  filter  is  shown  in  Figure  9a. 


Table  II.  Parameters  of  the  coupled  coplanar  waveguide  filter 


Section 

^odd 

^cvcn 

Gap  (C-C) 

Width 

S  (C-G) 

1 

40.2 

88.9 

10 

292.2 

24.6 

2 

47.3 

74.2 

10 

506.6 

118.5 

3 

47.3 

74.2 

10 

506.6 

118.5 

4 

40.2 

88.9 

10 

292.2 

24.6 

bi 

m 

m 

Bi 

m 

m 

m 

H 

vTo  50 
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Using  Libra,  the  circuit  is  simulated  and  the  resulting  IS21I  and  ISnl  are  plotted  in  Figure  9b. 
It  should  be  noted  that  coupled  CPW  line  parameters  used  in  the  Libra  simulations  are  for  a  CPW 
with  the  upper  layer  in  air. 

DIELECTRIC  DEPOSITION. 

One  of  the  crucial  components  of  a  vertically  interconnected  3D  system  is  the  choice, 
deposition  and  processing  of  the  dielectric  material.  There  are  many  different  dielectric  materials 
that  are  used  in  the  processing  of  semiconductor  devices.  Table  III  lists  the  properties  of  some  of 
the  dielectric  materials  used  in  the  implementation  of  3D  interconnects.  The  parameters  in  this  table 

are  for  a  50  Q  microstrip  transmission  line.  Gold  metal  is  used  to  calculate  the  conductor  losses. 


Table  11.  Properties  of  dielectric  materials  used  in  3D  interconnected  modules. 


BBC 

Si)2 

Sycar 

Polyimide 

Deposition 

Process 

Spin  coat  and 

bake 

CVD 

spin  coat  and 

bake 

spin  coat  and 

bake 

Dielectric 

Constant 

2.7 

4.0 

2.6 

3. 1-3.6 

Tan  5 

0.004 

0.03 

0.003 

0.001 

Possible 

26 

9 

25 

2.5-25 

thickness  (pm) 

LossAg\dielectric 

0.093 

0.712 

0.070 

0.023 

;dB)/cmjconductive 

0.812 

2.307 

0.838 

7.327-0.876 

Dielectric  materials  to  be  used  in  vertically  interconnected  microwave  and  millimeter  wave 
modules  should  have  some  stringent  mechanical,  chemical  and  electrical  properties.  These  can  be 
listed  as: 

a)  deposition  of  uniform  and  controlled  thickness  layers 

b)  adhesion  to  metal  (gold)  surface 

c)  adhesion  of  metal  (gold)  on  to  the  dielectric 
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d)  low  shrinkage  during  curing 

e)  compatibility  with  photoresists 

f)  prossesable  with  wet  and  dry  etching  techniques 

g)  maintain  its  properties  over  wide  range  of  temperature  operation. 

Polyimide  is  one  of  the  widely  used  dielectric  materials  in  3D  interconnect  circuitry.  In  this 
proposal  a  new  dielectric  material  referred  to  as  ‘Sycar’  developed  by  Hercules  Inc.  will  be  used 
due  to  its  excellent  moisture  resistance,  electrical  properties,  ionic  purity  and  ease  of  handling.  It  is 
applied  by  spin  coating  and  curing  at  200°  C  [21]. 

It  is  very  important  that  during  the  deposition  process,  the  thickness  of  the  dielectric  layer 
should  be  precisely  controlled  to  provide  the  desired  thickness  uniformity.  Any  variations  in  the 
thickness  of  the  layers  will  prevent  successful  implementation  of  the  circuit  elements  especially 
those  of  the  distributed  line  elements  whose  parameters  are  likely  to  depend  on  the  dielectric 
thickness.  As  has  been  discussed  above,  distributed  line  elements  such  as  CPW  are  recommended 
for  the  implementation  of  the  transmission  lines  since  they  are  less  dependent  on  the  dielectric 
thickness  provided  their  lateral  dimensions  are  small  compared  to  the  thickness  of  the  dielectric 
material 

Mechanical  properties  of  the  dielectric  material  also  play  a  very  important  role  in  the 
implementation  of  the  3D  interconnects.  Shrinkage  during  processing,  stability  of  the  material, 
adhesion  to  metal  as  well  as  adhesion  of  metal  on  the  dielectric,  and  pealing  strength  of  the  metal 
over  the  dielectric  are  some  of  the  properties  that  have  to  thoroughly  investigated. 

Compatibility  of  photoresists  with  the  dielectric  material  and  the  type  of  etching  process 
such  as  wet  or  dry  etching  are  also  major  steps  in  the  successful  implementation  of  the  3D 
interconnects.  It  is  possible  that  there  may  be  a  need  for  mechanical  surface  removal  by  lapping  and 
polishing.  Thus  the  dielectric  material  should  also  be  robust  enough  to  allow  this  type  of  abuse. 


VERTICAL  INTERCONNECTS 

Another  major  component  of  a  3D  T/R  module  is  routing  of  microwave  or  millimeter  waves 
from  one  level  to  another.  In  some  of  the  3D  interconnects  used  in  MMIC  processing,  novel 
coupling  schemes  between  various  layers  are  already  used.  If  a  complete  T/R  module  as  shown  in 
Figure  1  is  to  be  implemented,  several  ground  planes  will  separate  various  circuit  planes  from  each 
other.  Therefore,  it  will  be  necessary  to  couple  signal  from  one  layer  to  the  other  by  means  of 
vertical  posts  (vias).  IF  CPWs  are  used,  it  is  possible  that  parallel  transmission  line  TEM  modes 
may  also  be  excited  in  addition  to  CPW  TEM  modes.  To  prevent  these  unwanted  modes,  shorting 
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connections  between  the  top  and  lower  ground  planes  and  the  side  ground  planes  of  the  CPW  have 
to  be  made.  These  will  require  additional  vertical  posts. 


Side  view  Top  view 


Figure  1 0.  Vertical  Interconnecting  posts  between  layers.  Additional  posts  to  prevent 
possible  parallel  plate  TEM  modes  are  also  shown. 

Figure  10  shows  the  connecting  posts  (vias)  from  one  layer  to  the  next.  Even  though  the 
posts  are  shown  as  solid  uniform  cylinders,  their  shape  may  be  intentionally  modified  to  decrease 
the  return  loss.  Also,  the  cross  section  of  the  posts  may  not  be  uniform  vertically  as  a  result  of  the 
processing  steps  in  the  electrodeposition  of  the  posts.  In  either  case,  extensive  simulations  and 
refined  processing  steps  will  be  necessary  to  find  the  best  possible  coupling  configuration.  In 
Figure  10,  additional  posts  connecting  the  ground  planes  and  the  side  ground  electrodes  of  the 
CPWs  are  also  shown. 
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a)  Deposit  metal  layer  for 
transmission  line  or 
ground  plane  and  process 

b)  Deposit  a  thin  dielectric 
layer 

c)  Open  via  (post)  holes 

d)  Electrodeposit  metal 
posts  to  required  height 

e)  Spin  coat  and  process 
dielectric 

f)  Mechanically  planarize 
surface 

(a) 


a)  Deposit  metal  layer  for 
transmission  line  or 
ground  plane  and 
process 

b)  Deposit  dielectric  layer 
to  the  required  thickness 

c)  Open  via  (post)  holes 
in  dielectric 

d)  Electrodeposit  metal 
posts  to  required  height 


(b) 


Figure  11.  Two  alternate  processes  for  vertical  posts. 

Figure  11  shows  two  possible  multi  layer  processing  steps  for  realizing  dielectric  and 
vertical  post  deposition.  The  process  shown  in  Figure  1  la  is  easier  to  implement  and  is  expected  to 
generate  slightly  non-uniform  vertical  posts.  This  process  involves  first  the  deposition  of  a 
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transmission  line  or  a  ground  plane.  A  thin  dielectric  layer  is  then  deposited  over  the  metal.  Using 
photoresist,  the  post  locations  and  sizes  are  exposed.  Metal  posts  are  then  electrodeposited  to  the 
required  height.  The  photo  resist  is  then  removed  and  a  thick  dielectric  layer  is  spin  coated  and 
cured.  This  process  will  form  mesas  over  the  posts.  Mechanical  lapping  may  be  required  to 
planarize  the  upper  surface  of  the  dielectric  material  to  the  depth  of  the  metal  posts.  Even  though 
this  process  seems  to  be  attractive  in  the  overall  implementation  of  the  3D  interconnects,  required 
mechanical  lapping  may  not  be  desirable. 

The  second  process  shown  in  Figure  1  lb  begins  with  metal  deposition  for  the  transmission 
line  or  a  ground  plane.  The  dielectric  material  is  then  spin  coated  and  cured  to  the  required  final 
thickness.  Photoresist  is  then  used  to  locate  the  post  locations.  Through  a  wet  or  dry  etching 
process,  the  dielectric  material  is  removed  from  these  openings  for  via  holes.  Gold  metal  is  electro- 
deposited  in  these  holes  to  the  height  of  the  dielectric  layer.  This  process  requires  anisotropic 
etching  of  vertically  uniform  via  holes  in  a  thick  dielectric  material.  If  this  can  be  achieved,  uniform 
cross  sectional  electrodeposition  of  metal  posts  will  be  possible. 


MODELING  AND  SIMULATIONS 

In  order  to  implement  an  optimum  vertically  interconnected  3D  T/R  module,  extensive 
component  and  circuit  simulations  are  necessary. 

Even  though  circuit  simulation  software  such  as  Libra,  MDS  and  Compact  are  available, 
they  do  not  include  some  of  the  modified  circuit  topology  that  will  be  used  with  the  3D  T/R 
modules.  For  example,  the  CPW  model  used  in  Libra  is  for  a  CPW  with  the  upper  surface  exposed 
to  air.  In  the  CPWs  that  will  be  used  in  the  3D  modules,  the  electrodes  will  be  completely 
immersed  in  a  dielectric  material  and  there  will  also  be  an  upper  ground  plane  in  addition  to  the 
conventional  lower  ground  plane.  In  the  quasistatic  solution  given  in  Equation  1  for  the  CPW,  both 
complete  dielectric  immersion  and  the  presence  of  the  upper  and  the  lower  ground  planes  are 
included.  Since  Libra  does  not  have  this  type  of  circuit  element  in  the  Library,  either  a  new  library 
element  has  to  be  added  to  the  simulator,  or  a  new  simulation  procedure  has  to  be  developed.  This 
is  also  true  for  the  coupled  CPWs.  The  filter  simulations  presented  in  Figure  9  are  made  using  the 
conventional  CPW  models  that  are  available  in  Libra.  The  response  of  the  filter  will  be  modified  by 
the  presence  of  the  upper  ground  plane  as  well  as  complete  immersion  of  the  CPW  in  the  dielectric 
medium.  The  lack  of  these  elements  in  the  present  simulation  software  libraries  will  generate  large 
simulation  errors  especially  when  resonant  circuit  elements  are  used  in  a  circuit. 

Some  of  the  simulation  software  also  lack  detailed  device  modeling  especially  for  the  power 
devices  [19,20],  Since  temperature  effects  modify  the  operating  characteristics  of  power  devices 
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such  as  HBTs,  new  nonlinear  models  has  to  be  introduced  into  the  software  or  new  device  files 
have  to  generated  and  used  with  SPICE  like  programs. 

There  are  also  many  electromagnetic  simulators  available.  Among  these,  Microwave  Lab 
and  Sonnet  are  the  two  software  which  are  exclusively  dedicated  to  modeling  planar  circuit 
elements.  CPWs  and  the  vertical  interconnects  will  be  simulated  using  these  software.  Even  though 
these  programs  require  large  computer  memory  to  run  complicated  geometry,  they  provide  both 
electric  and  magnetic  field  distributions,  current  distributions  in  the  conductors  as  wells  the  S 
parameter  data  at  a  given  frequency.  These  are  essential  in  the  successful  implementation  of  various 
circuit  components  especially  in  finalizing  the  shapes  and  locations  of  the  interconnects  for 
minimum  return  loss. 

It  can  be  concluded  that  successful  implementation  of  3D  T/R  modules  will  require 
extensive  and  careful  device  and  circuit  modeling  and  simulations  in  addition  to  developing  novel 
processing  techniques. 


SHORT  TERM  GOALS 

The  short  term  goals  of  this  research  proposal  is  to  develop  technology  for  vertically 
interconnected  MMIC.  The  major  emphasis  in  this  phase  of  this  project  is  to  develop  a  reliable  and 
controllable  dielectric  deposition  process  so  that  necessary  number  of  dielectric  layers  can  be 
deposited. 

The  initial  phase  of  the  program  will  be  concentrated  in  the  deposition  of  the  dielectric  and 
subsequent  metal  layers.  The  uniformity  as  well  as  the  reproducibility  of  the  dielectric  layers  are 
very  important.  Thickness  control  and  adhesion  to  various  surfaces  such  as  Silicon  and  a  metal 
surface  are  major  parts  of  the  development  program.  If  a  successful  dielectric  deposition  process  is 
developed  and  the  thickness  of  the  deposited  layers  can  be  controlled  to  within  a  micrometer,  the 
ensuing  circuit  designs  will  be  relatively  simplified. 


WORK  SCOPE: 

A  key  technology  will  be  developed  for  3D  MMIC.  Emphasis  will  be  placed  on  the 
deposition  of  uniform  and  controllable  dielectric  layer  thickness  for  vertical  interconnects.  Various 
processing  technologies  will  be  developed  for  uniform  via  holes  and  associated  metal  posts. 
Coplanar  waveguides  and  other  transmission  lines  sandwiched  between  layers  will  be 
implemented. 
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Work  Period:  1997  (1  year) 

Total  Funding:  (  $  25  K,  detailed  budget  is  attached) 

Man  Power:  1  professor  and  1  graduate  student 

The  detailed  worksteps  are  as  follows: 

-  Sycar  will  be  initially  used  as  a  dielectric  material.  Other  possible  dielectric  materials  such 
as  polyimide  and  BCB  will  be  investigated. 

-  Sycar  multi-layer  process  will  be  developed  with  emphasis  on 

-  thickness  control 

-  uniformity  of  thickness 

-  compatibility  with  wet  and  dry  etching  techniques 

-  Multi-layer  vertical  interconnect  technology  will  be  developed  with  emphasis  on 

-  via  holes 

-  gold  interconnect  posts 

-  metal  dielectric  interface 

-  Extensive  study  of  the  best  combination  of  waveguide  structures  for  vertically 
interconnected  multi-layer  circuit  formation  will  be  studied.  Emphasis  will  be  concentrated  on 
Coplanar-Waveguide  because  of  less  dependence  on  thickness  control.  Microstrip,  stripline,  et  al. 
will  be  studied  provided  uniform  thickness  control  is  achieved  during  the  dielectric  deposition 
process. 

-  Passive  components  (capacitor  and  inductor)  will  be  developed. 

-  Electromagnetic  simulation  of  vertically  interconnected  transmission  lines  between  layers 
will  be  extensively  studied  using  existing  software  (MicroWave  Lab)  and  in-house  developed 
numerical  techniques. 

LONG  TERM  GOALS 

Long  term  goals  of  the  project  will  be  to  implement  complete  T/R  modules  and  test  them  in 
a  2x2  array  for  their  applicability  in  a  conformal  phased  array  antenna  system.  This  challenging 
task  will  be  achieved  through  extensive  theoretical  analysis,  simulations  and  experimental  testing  of 
various  components  and  circuit  modules. 
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PROGRAM  SCHEDULE 


Month 

1 

2 

mm 

5 

6 

mm 

8 

9 

10 

11 

12 

Process  development 
-Dielectric  Layer 

Material  Study 
-Layer  Deposition 
-Metalization 
-Via  Process 

Passive  component 
-design 
-fab  and  test 

Electromagnetic  and  circuit 
Simulation 
-multi-layer  design 
-coupling  study 
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DEVELOPMENT  OF  A  PROTOTYPE  TO  TEST  AND  DEMONSTRATE  THE 
MANUFACTURING-ORIENTED,  DESIGN-DIRECTED  COST  ESTIMATION  (MODDCE)  FRAMEWORK 


Allen  G.  Greenwood 
Associate  Professor 
Department  of  Industrial  Engineering 
Mississippi  State  University 

Abstract 

This  report  discusses  the  development  and  application  of  a  software  prototype  that  demonstrates  the 
application  of  the  MODDCE  (Manufacturing-Oriented,  Design-Directed  Cost  Estimation)  framework  and 
provides  a  proof  of  concept  for  using  an  object-oriented  approach  to  implement  the  framework.  The  MODDCE 
framework  defines  the  modeling  and  analysis  progression  from  product  features  to  an  enterprise  view  of  the 
production  of  a  product.  The  most  design-relevant  portion  of  the  framework  is  implemented  through  an  object- 
based  representation  of  an  integrated  product/process  cost  assessment  environment.  This  representation  is  based 
upon  the  premise  that  cost  assessment  must  concurrently  consider  three  primary  design  dimensions,  or  what 
may  be  referred  to  as  the  product  triad  —  form,  material,  and  process.  The  prototype  is  referred  to  as  the 
OCEANS  (Object-based  Cost  Assessment  and  Estimation  for  desigN  Support)  system  and  its  initial  focus  is  on 
the  design  of  aircraft  structures.  The  OCEANS  prototype  is  a  critical  first  step  towards  a  comprehensive 
software  system  and  application  environment  that  effectively  integrates  cost  assessment  into  the  design  process. 
In  addition,  the  prototype  provides  an  important  means  to  identify,  discuss,  and  address  a  variety  of  cost/design 
issues  with  industry  IPTs  (Integrated  Product  Teams)  and  product  designers. 
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DEVELOPMENT  OF  A  PROTOTYPE  TO  TEST  AND  DEMONSTRATE  THE 
MANUFACTURING-ORIENTED,  DESIGN-DIRECTED  COST  ESTIMATION  (MODDCE)  FRAMEWORK 


Allen  G.  Greenwood 


Introduction 

In  order  to  develop  affordable  products,  designers  and  Integrated  Product  Teams  (IPTs)  must  place 
cost/manufacturability  assessment,  along  with  performance,  at  the  core  of  the  Integrated  Product  Process 
Development  (IPPD)  or  concurrent  engineering  process.  Designers  and  IPTs  need  to  effectively  address  both 
product  and  process  issues  early  in  a  product’s  design;  i.e.,  they  must  have  tools  available  that  are  sensitive  to 
process  parameters  as  well  as  product  parameters.  Cost,  schedule,  and  risk  must  be  assessed  during  design  in 
manner  similar  to  the  way  aerodynamics,  weight,  etc.  are  assessed.  Therefore,  there  is  a  strong  need  for 
advanced  design  technologies  to  effectively  support  the  transformation  of  customer  requirements  into  a  product 
by  providing  cost/manufacturability  measures  of  the  design  and  concurrently  considering  the  production 
process  in  conjunction  with  the  product’s  intended  form  and  material. 

Unfortunately,  such  cost/manufacturability  assessment  tools  are  not  readily  available  today.  A  review 
by  Thomas  [1]  of  papers  published  in  major  journals  in  the  past  five  years,  resulted  in  the  following 
conclusions:  (1)  research  in  manufacturing  cost  estimation  has  involved  rather  few  people  and  as  a  result  there 
are  no  broadly  applicable  mathematical  models,  (2)  there  is  widespread  dissatisfaction  with  the  way  overhead 
or  indirect  costs  are  handled,  and  (3)  the  lack  of  broadly  applicable  cost  models  leads  to  few  efforts  directed 
towards  their  integration  into  CAD/IPPD  systems.  Greenwood  [2]  adds  the  following  issues  to  the  list  of 
concerns  with  the  current  state  of  manufacturing  cost  estimating  within  the  IPPD  process:  (1)  Cost  model 
drivers  need  to  be  more  closely  related  to  the  characteristics  of  manufacturing  processes  and  operations.  The 
process  needs  to  be  designed  as  much  as  the  product;  all  too  often  the  product  is  designed  and  then  the  process 
(manufacturing)  must  be  designed  to  accommodate  it  or  the  product  must  be  redesigned  to  accommodate  the 
process.  (2)  Cost  model  drivers  need  to  be  more  closely  related  to  product  features.  (3)  Manufacturing  process 
design  and  resulting  costs  need  to  be  considered  earlier  in  the  design  of  the  product.  (4)  Product  and  process 
design  must  be  sensitive  to  the  existing  or  planned  manufacturing  state,  especially  in  an  agile  environment;  i.e. 
a  design  may  change  based  on  current  factory  utilization.  Cost  models  need  to  capture  the  dynamics  and 
stochastic  nature  of  manufacturing.  (5)  The  relationship  between  product/process  costs  and  indirect  costs  need 
to  be  better  defined  and  understood.  Indirect  costs,  both  manufacturing  and  non-manufacturing,  need  to  be 
“modeled”  so  that  they  can  be  addressed  during  design.  (6)  Legacy  cost  system  data  are  often  not  tracked  at  the 
level  of  detail  needed  in  design.  (7)  Legacy  cost  system  data  are  often  based  on  processes  and  methods  that  are 
no  longer  used. 

In  order  to  design  affordability  into  products,  manufacturing  and  cost  considerations  must  be 
addressed  early  in  the  design  process.  Since  this  ability  is  lacking  in  today  s  design  environment,  it  is  important 
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to  freshly  examine  how  costing  should  be  done  in  order  to  support  design  and  new  costing  methodologies  must 
themselves  be  designed  to  fit  into  tomorrow’s  environment.  The  prototype  described  in  this  report  is  a  step 
towards  meeting  these  needs.  It  is  entitled  OCEANS  (Object-based  Cost  Estimation  and  Assessment  for  desigN 
Support).  The  prototype  application  is  based  on  the  most  design-relevant  portion  of  the  Manufacturing- 
Oriented,  Design-Directed  Cost  Estimating  (MODDCE)  framework  [2,  3].  The  framework  defines  the  modeling 
and  analysis  progression  from  product  features  to  an  enterprise  view  of  the  production  of  a  product.  The 
OCEANS  prototype  is  also  based  on  an  object-based  approach  [4]  that  was  developed  to  provide  a  structure  for 
integrating  product  and  process  considerations  early  in  the  design  phase  of  a  product’s  life  cycle.  Both  the 
MODDCE  framework  and  the  object-based  structure  are  defined  below. 

The  MODDCE  Framework 

The  OCEANS  prototype  is  based  upon  the  Manufacturing-Oriented,  Design-Directed  Cost  Estimation 
(MODDCE)  framework  [2,  3],  This  framework  defines  the  modeling  and  analysis  progression  from  product 
features  to  an  enterprise  view  of  the  production  of  a  product.  As  illustrated  in  Figure  1,  it  is  composed  of  five 
levels  that  progress  from  a  hierarchical  decomposition  of  a  product’s  features  through  three  intermediate  levels 
to  an  enterprise  model  that  links  product  features  with  manufacturing  and  non-manufacturing  entities 
throughout  the  enterprise.  The  highest  or  enterprise  level  provides  the  most  comprehensive  analysis  capabilities 
for  assessing  the  effect  of  changes  in  product  and  process  design  on  the  enterprise  and  the  resulting  effect  of 
those  changes  on  “total”  cost.  As  shown  in  Figure  1,  each  level  is  linked  to  a  set  of  design  parameters  that  drive 
the  types  of  costs  that  are  considered  at  that  level.  The  dashed-line  box  in  the  left-hand  portion  of  Figure  1  is 
the  portion  of  the  framework  where  a  product’s  design  evolves,  in  terms  of  what  it  looks  like,  what  it  is  made 
of,  and  how  it  is  constructed.  Hence,  the  focus  of  this  research  and  development  effort  is  within  this  design¬ 
relevant  portion  of  the  framework. 

The  progression  from  one  level  of  analysis  to  another  in  Figure  1  is  accomplished  in  stages;  the 
numbered  arrow  lines  in  Figure  1  represent  stages.  For  example,  in  order  to  move  from  the  hierarchical 
decomposition  of  a  product’s  features  to  the  corresponding  manufacturing  elements  that  create  those  features,  a 
stage  (Stage  1  in  Figure  1)  maps  or  associates  features  with  transformation  elements.  The  manufacturing 
elements  may  be  a  specific  manufacturing  process  or  a  general  process  family. 

The  second  stage  combines  manufacturing  elements  into  operations.  Operations  define  specific 
equipment,  labor  skill  requirements,  and  tooling  and  consider  precedence  among  elements  and  operations.  The 
operations  level  provides  the  first,  yet  limited,  opportunity  to  make  product/process  cost  trades. 
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Figure  1:  MODDCE  framework  provides  the  basis  for  “designing”  and  integrating  costing  methodologies  that 
link  product  features  to  an  enterprise  view  of  the  production  of  the  product. 

The  third  stage  transforms  discrete  process  operations  into  system-level  manufacturing  process  objects. 
It  is  at  this  level,  and  through  the  use  of  the  objects,  that  manufacturing  process  operations  are  placed  in  the 
context  of  an  actual  production  environment;  i.e.,  the  product  is  integrated  into  the  fabric  of  a  manufacturing 
environment.  As  a  result,  the  objects  become  the  basis  for  the  design  of  production  operations  and  facilities  by 
considering  such  things  as  product  movement,  storage  and  handling  issues,  resource  availability,  interference 
with  other  products,  grouping  of  elements  into  cells,  etc.  Process  objects  are  dynamic  (time  sensitive)  and 
stochastic  (the  ability  to  incorporate  random  events)  —  both  of  which  are  common  characteristics  of 
manufacturing  systems,  but  are  seldom  considered  in  the  product  design  phase.  It  is  only  through  the  dynamic 
and  stochastic  capabilities  of  the  manufacturing  process  object  that  a  system’s  performance  can  realistically  be 
evaluated  and  virtual  manufacturing  can  truly  be  achieved.  It  is  only  through  manufacturing  process  objects  and 
the  interaction  among  objects  that  manufacturing  performance,  including  costs,  can  be  evaluated.  System-level 
analysis  provides  a  means  to  identify  bottlenecks  and  other  potential  problems  before  the  product  is  committed 
to  production.  It  is  only  at  this  level  that  the  impact  of  a  product’s  characteristics  on  a  production  process  can  be 
assessed  and  evaluated.  This  is  especially  important  in  an  agile  environment  or  for  accommodating  small 
batches  of  dissimilar  products  (e.g.  items  for  the  defense  industry  being  produced  on  commercial  lines).  In  fact, 
the  design  of  the  product  may  change  based  on  the  current  state  of  the  manufacturing  facility,  e.g.  available 
resources,  supplier  constraints,  etc. 

The  fourth  stage  moves  modeling  and  analysis  to  the  fifth  and  highest  level  —  the  enterprise  level. 
Enterprise-level  costing  extends  design  sensitivity  beyond  direct  manufacturing  costs  to  include  non- 
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manufacturing  and  indirect  costs.  The  impact  of  changes  in  product/process  design  on  indirect  entities  in  the 
enterprise  is  assessed,  and  conversely,  the  impact  of  changes  in  indirect  entities  on  manufacturing  processes, 
and  hence  on  the  product,  are  also  assessed.  For  example,  specification  of  a  new  material  during  product  design 
will  not  only  affect  operations  in  terms  of  the  type  of  equipment  used,  labor  skills,  tooling,  scrap  rate,  etc.  but 
may  also  affect  numerous  indirect  entities  in  the  organization,  including  training,  purchasing,  warehousing,  etc. 
Normally  these  effects  are  assumed  to  be  captured  in  the  application  of  very  general  indirect  or  overhead  rates. 
However,  enterprise-level  costing  permits  a  more  realistic  assessment  of  the  impacts  on  indirect  activities.  In 
order  to  realize  enterprise-level  cost  estimation,  the  firm’s  business  processes  must  be  identified,  modeled,  and 
linked  to  the  appropriate  manufacturing  process  object(s).  In  this  case,  enterprise  is  defined  in  its  broadest 
context  and  includes  all  non-manufacturing  entities  in  the  company  (administration,  purchasing,  etc.),  as  well  as 
external  entities  such  as  customers  and  suppliers. 

The  Object-based  Foundation 

Object-based  technologies  were  investigated,  as  part  of  another  research  project  [4],  and  found  to  offer 
a  strong  promise  for  facilitating  the  development  and  use  of  an  integrated  costing  environment  in  support  of 
concurrent  product  and  process  design.  Object  orientation  provides  a  more  natural  and  easier  means  —  as 
compared  to  more  traditional  systems  analysis,  design,  programming,  and  database  approaches  —  to  represent 
and  model  complex  systems  from  individual  components.  In  object-oriented  applications,  data  objects  that 
represent  real-world  entities  are  often  the  focus;  whereas,  applications  that  employ  a  conventional  programming 
approach  are  often  procedure  focused.  Therefore,  from  a  modeling  perspective,  as  noted  by  Khoshaflan  and 
Abnous  [5],  decomposition  of  a  system  from  an  object-oriented  perspective  is  more  intuitive  and  involves  well- 
defined  behaviors  and  encapsulated  algorithms;  in  contrast,  procedure-based  decomposition  is  unnatural,  in 
most  cases. 

The  OCEANS  prototype  is  based  upon  the  object-based  representation  of  an  integrated 
product/process  cost  assessment  environment  that  was  developed  in  [4],  This  representation  is  based  upon  the 
premise  that  cost  assessment  must  concurrently  consider  three  primary  design  dimensions,  or  what  may  be 
referred  to  as  the  product  triad  —  form,  material,  and  process.  That  is,  in  order  to  adequately  cost  an  item,  one 
must  know  what  it  looks  like,  what  it  is  made  of,  and  how  it  is  produced.  The  last  element  is  the  one  that  has 
received  the  least  consideration  in  the  past.  However,  in  order  to  design  and  build  affordable  products  and  to 
make  meaningful  trades  among  alternatives,  all  three  elements  must  be  an  integral  part  of  the  design  process 
and  “cost”  is  a  natural  metric  to  tie  them  together.  While  the  amount  and  level  of  information  available  early  in 
the  design  phase  may  be  limited,  manufacturing  process  still  must  be  addressed  and  assumptions  must  be  made 
as  to  how  the  product  will  be  produced. 

The  object-based,  product-triad  approach  is  represented  conceptually  by  the  triangular  structure  in  the 
lower  portion  of  Figure  2.  The  three  primary  design  dimensions,  or  product  triad  -  form,  material,  and  process 
-  are  defined  as  object  hierarchies  and  are  represented  as  circles  in  the  figure.  Secondary  objects,  shown  as  the 
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intersection  of  two  primary  objects  and  represented  by  ovals  in  the  object  structure  are  complex  objects,  in  that 
they  inherit  properties  from  two  primary  objects.  For  example,  the  Form-Material  object  inherits  properties 
from  both  the  Form  and  Material  hierarchies.  There  is  a  second  type  of  complex  object,  shown  as  the 
intersection  of  all  three  primary  objects  and  represented  by  the  triangle  in  the  center  of  the  object  structure.  This 
Form-Material-Process  (F-M-P)  object  inherits  properties  from  all  three  hierarchies.  This  object  is  especially 
important  since  it  provides  the  basis  for  cost  assessment;  i.e.  costing  cannot  be  completed  until  the  design  has 
considered  the  item’s  form,  material,  and  manufacturing  process. 


Item  structure 


Cost  structure 


* 

[  Item  ] 


[  Cost  ] 


Figure  2:  Cost  assessment  based  on  a  Form-Material-Process  instance. 


Two  additional  elements  are  required  in  order  to  complete  a  cost  assessment.  First,  the  item  being 
designed  must  be  considered  in  its  context  as  a  part  of  some  larger  system  or  product,  as  illustrated  by  the  item 
structure  in  the  upper  left  portion  of  Figure  2.  The  item  structure  primarily  provides  programmatic  information, 
such  as  quantity,  schedule,  etc.  These  properties  are  often  important  factors  in  design  since,  for  example,  low 
production  rate  or  quantity  may  preclude  an  item  from  being  produced  by  certain  processes  because  they  would 
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result  in  prohibitive  costs  and  become  unaffordable.  Cost  structure  is  the  second  element  that  must  be  combined 
with  the  Form-Material-Process  design  definition.  The  cost  structure  as  shown  in  the  upper  right  portion  of 
Figure  2,  defines  the  types  of  cost  that  are  addressed  in  the  analysis. 

Approach  to  Prototype  Development 

The  OCEANS  prototype  is  a  critical  first  step  towards  a  comprehensive  software  system  and 
application  environment  that  effectively  integrates  cost  assessment  into  the  design  process.  It  was  developed  in 
order  to  demonstrate  the  application  of  the  MODDCE  framework  and  provide  a  proof  of  concept  for  using  an 
object-based  approach  in  order  to  implement  the  framework.  In  addition,  the  prototype  was  developed  in  order 
to  provide  an  important  means  to  identify,  discuss,  and  address  a  variety  of  cost/design  issues  with  industry 
IPTs  and  designers.  The  demonstration  and  application  of  this  tool  in  an  industrial  environment  will  result  in  a 
better  understanding  of  the  types  of  data,  knowledge,  models,  interfaces,  etc.  that  are  required  in  order  to 
represent,  process,  and  display  information  in  a  manner  that  fosters  the  concurrent  consideration  of 
product/process  cost  during  design. 

The  OCEANS  system  is  a  fully-functional  software  application;  e.g.,  it  is  built  upon  a  sound  database 
structure,  employs  an  intuitive  interface,  provides  meaningful  real-time  feedback  to  the  user  on  the  cost  impact 
of  design  changes,  and  provides  an  efficient  means  for  maintaining  the  data.  Of  course,  being  a  prototype,  the 
system,  is  limited  in  the  breath  and  depth  of  its  application  domain.  However,  these  limitations  are  by  design, 
since  it  would  be  premature  to  fully  develop  an  application  until  many  of  the  critical  issues  are  better 
understood,  modeled,  and  represented. 

The  data  used  to  populate  the  prototype  is  a  small  portion  of  the  information  provided  in  the  multi¬ 
volume  Manufacturing  Cost/Design  Guide  (MC/DG)  that  was  developed  by  Battelle,  as  part  of  the  Air  Force’s 
Integrated  Computer  Aided  Manufacturing  (ICAM)  program  [6,  7]. 

A  portion  of  this  research  involved  an  investigation  into  the  type  of  software  that  should  be  used  to 
prototype  and  develop  the  application.  Due  to  the  growing  dominance  of  the  Microsoft  Windows  operating 
system  and  the  plethora  of  development  languages  and  applications  that  it  supports,  it  was  decided  to  develop 
the  OCEANS  system  in  a  Windows  95/NT  environment. 

A  variety  of  off-the-shelf  applications  were  reviewed  for  inclusion  in  OCEANS.  Several  potentially 
attractive  applications  were  strongly  considered.  Knowledge  Base  Engineering’s  (KBE)  Integrated  Knowledge 
Environment  ( IKE)  possessed  some  of  the  capabilities  needed  to  develop  OCEANS.  However,  KBE  was 
developing  another  product,  Object  cZar  (025),  which  would  provide  many  more  of  the  capabilities  required  by 
OCEANS.  For  example  OZ  provides  a  means  to  easily  incorporate  flexible  form,  material,  and  process 
hierarchies,  as  well  as  cost  and  item  structures.  In  addition,  OZ  provides  an  effective  means  to  create  and 
manage  “collections”  of  objects.  Therefore,  it  was  decided  that  it  would  be  more  effective  in  the  long  run  to 
wait  for  a  more  mature  release  of  OZ.  Incorporation  of  technologies  from  OZ,  expected  during  the  first  quarter 
of  1998,  will  greatly  enhance  the  next  version  of  OCEANS. 
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Cognition  Corporation’s  Cost  Advantage  ( CA )  provides  strong  manufacturing  process  cost  modeling 
capabilities.  While  Cost  Advantage  is  a  mature  product  in  the  UNIX  environment,  introduction  of  the  Windows 
version  has  been  continually  postponed.  However,  its  Windows  NT  version  seems  certain  to  be  released  in  the 
first  quarter  of  1998.  Therefore,  the  next  version  of  OCEANS  will  most  certainly  be  greatly  enhanced  by 
utilizing  the  many  capabilities  of  Cost  Advantage.  A  completely  new  Cognition  product,  Knowledge  Center,  is 
expected  to  be  released  in  early  1998  and,  based  on  preliminary  information,  it  will  enhance  the  OCEANS 
system’s  object  structure  as  well  as  its  data/knowledge  representation  and  processing. 

Since  none  of  the  above  applications  were  available  in  time  to  develop  the  prototype,  the  initial 
version  of  the  OCEANS  system  was  developed  in  Visual  C++  with  Microsoft  Access  providing  the  relational 
database  capabilities.  However,  the  next  version  of  OCEANS  will  rely  less  on  C++  and  more  on  the  off-the- 
shelf  capabilities  of  OZ,  Cost  Advantage,  and  Knowledge  Center . 

User  "view"  of  the  oceans  system 

The  description  of  the  OCEANS  prototype  is  provided  in  terms  of  two  “views”  of  the  application.  The 
primary  or  user  view  is  the  environment  in  which  the  designer  or  IPT  interacts  with  the  system.  The  secondary 
or  administration  view  is  concerned  with  updating  existing  information  and  entering  new  information  into  the 
system.  Screen  shots  from  each  of  the  views  are  used  to  describe  the  capabilities  of  the  OCEANS  system. 

The  basic  interface  screen  for  the  user  view  is  shown  in  Figure  3.  The  user  view  contains  six  user  input 
“tabs”  -  material,  form,  process,  DICE,  Assembly,  and  draft  image  -  that  are  used  to  specify  a  design  for 
costing  purposes.  The  “draft  image”  tab  in  Figure  3  shows  a  general  picture  of  the  product  that  is  being 
designed.  In  this  case,  a  fuselage  panel  is  being  designed.  It  is  composed  of  three  basic  items  —skin  panel, 
stringer(s),  and  frame(s). 


Figure  3  -  Basic  user  view  showing  the  product  that  is  being  designed. 
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The  three  primary  tabs  in  the  OCEANS  system  -  Material,  Form,  and  Process  -  correspond  to  the 
“product  triad”  that  was  described  earlier  in  the  “The  Object-based  Foundation”  section.  This  follows  the 
premise  that  an  item’s  cost  cannot  be  estimated  until  it  has  been  determined  what  the  item  looks  like,  what  it  is 
made  of,  and  how  it  will  be  produced.  It  should  be  noted  that  the  properties  that  are  displayed  or  controlled  on 
each  tab  represent  a  minimal  set  of  properties  to  cost  an  item.  These  properties  are  based  on  the  cost  estimating 
relationships  provided  in  [6,  7],  This  initial  version  of  OCEANS  is  based  on  a  literal  implementation  of  the 
MC/GD.  Future  versions  would  employ  a  flexible  structure  that  would  incorporate  a  variety  of  costing 
approaches  and  data. 

Figure  4  illustrates  the  “Material”  tab.  Two  elements  of  the  design  are  specified  through  this  screen  - 
the  items  that  comprise  the  product  (and  their  quantity)  and  the  type  of  material  for  each  item.  In  all  cases 
throughout  this  example,  white  boxes  on  the  screen  denote  areas  where  the  user  can  enter  or  change  a  value, 
such  as  the  specification  of  material  Titanium  6A1-4V3  in  Figure  4.  Grayed-out  panels  in  the  screen  display 
information  that  cannot  be  changed  through  the  particular  tab  that  is  being  displayed,  such  as  the  price  of  the 
material  in  Figure  4.  In  this  case,  price  is  not  a  variable  that  the  designer  can  change  -  it  is  done  separately 
through  the  administration  view  described  later.  However,  the  process  specification,  even  though  it  is  grayed 
out  in  Figure  4,  can  be  changed  by  the  designer,  but  through  the  Process  tab. 

The  item  list  for  the  product  that  is  being  designed  is  controlled  through  the  Material  tab  screen.  In  the 
example  in  Figure  4,  the  product  is  composed  of  four  items  -  “skin,”  “stringer,”  “Frame,  Zee,”  and  “Frame, 
Angle.”  Since  “Frame,  Angle”  is  highlighted  in  the  list  in  Figure  4,  all  of  the  displays  will  relate  to  this  item. 
The  number  of  items  that  are  used  in  the  product  is  specified  to  the  right  of  the  item  name;  in  this  case,  there  is 
one  “Frame,  Angle”  used  in  the  design.  Also  note  that  the  material  that  is  specified  in  Figure  4  is  actually  for 
the  item  “Frame,  Angle.”  Items  are  easily  added  to  and  deleted  from  the  items  list  through  a  button  control  that 
is  a  part  of  the  general  toolbar.  (The  toolbar  controls  are  not  shown  in  the  screen  shots;  they  a  similar  to 
standard  toolbars  found  in  all  Windows  applications.) 

The  graphs  in  the  right  panel  of  Figure  4  display  the  relationship  between  recurring  and  non-recurring 
labor  hours  and  the  primary  cost  driver,  in  this  case  the  length  of  the  item.  The  dot  on  the  line  graph  indicates 
where  this  item’s  cost  driver  falls  in  the  relevant  range  of  the  relationship;  in  this  case  the  item’s  length  is  three 
feet  and  at  the  lower  end  of  the  estimating  range.  The  item’s  length  is  specified  through  the  Form  tab  that  is 
described  below.  The  cost  estimating  relationships  that  are  plotted  in  the  graphs  in  Figure  4  are  obtained  from 
the  Manufacturing  Cost/Design  Guide  (MC/DG)  [6,  7].  In  this  case,  CED-T-1B  refers  to  the  specific  cost 
estimating  data  for  producing  a  cylindrically-contoured  angle  member  from  titanium  using  the  Brake/Hot 
Stretch  process. 
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Figure  4  -  Basic  user  view  for  specifying  the  items  that  comprise  the  product  and  their  materials. 
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Figure  5  -  Basic  user  view  for  specifying  the  form  of  the  product. 
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Figure  5  provides  an  example  “Form”  tab.  In  this  case,  the  “Frame,  Angle”  item  has  an  angle  cross- 
section  and  is  cylindrically  contoured,  as  show  by  the  selected  (highlighted)  form  property  in  Figure  5.  In 
addition  to  specifying  shape,  the  Form  tab  is  used  to  enter  an  item’s  basic  dimensional  properties,  such  as 
length,  width,  and  thickness.  The  next  version  of  OCEANS  will  include  a  hierarchy  of  forms.  The  use  of  an 
object-based  hierarchical  approach  will  permit  each  form  to  have  its  own  set  of  properties,  as  well  as  inherit 
properties  from  its  “parent”  in  the  hierarchy.  It  is  anticipated  that  group  technology  will  facilitate  the 
classification  of  candidate  forms. 

The  Process  tab  shown  in  Figure  6  is  used  to  specify  the  primary  manufacturing  process  for  producing 
an  item,  in  this  case,  “Brake/Hot  Stretch.”  The  OCEANS  system’s  process-related  properties  include  labor  rates 
and  learning  curve  factors.  The  next  version  of  OCEANS  will  include  a  hierarchy  of  processes  and  the  list  of 
candidate  processes  will  include  only  those  that  are  compatible  with  the  specified  form  and  process  properties. 

Implementation  of  the  assembly  process,  under  the  OCEANS  system’s  current  database  structure, 
presented  some  development  complications.  The  problem  appears  to  be  solved  but  since  the  approach  has  not 
been  fully  tested,  it  is  not  included  in  this  version.  This  may  be  a  moot  point  because  the  utilization  of  a  more 
object-oriented  database  structure,  e.g.  through  OZ  or  the  Knowledge  Center,  should  eliminate  the  problem. 
However,  the  screen  layout  for  the  Assembly  tab  is  provided  in  Figure  7  in  order  to  illustrate  its  intended 
operation.  All  of  the  items  included  in  a  design  would  appear  in  the  lower  right  window,  thereby  providing  a 
pick  list  of  items  to  be  assembled.  Selected  items  are  moved  to  the  window  in  the  lower  left  potion  of  the 
screen.  The  method  for  fastening  the  items,  along  with  other  requirements  and  parameters,  are  specified  in  the 
boxes  in  the  upper  left  portion  of  the  screen. 

Cost  Reports 

Figure  8  provides  an  example  of  one  of  the  two  types  of  cost  reports  that  are  available  from  the 
OCEANS  system.  This  summary  view  reports  only  the  top-level  costs  —  total  recurring  cost,  total  non-recurring 
cost,  and  average  cost  per  shipset  —  for  the  entire  product  that  is  being  designed.  In  this  case,  the  costs  are  for 
the  entire  fuselage  panel  that  was  specified  and  described  above;  it  is  named  “Sample  1.”  This  view  displays 
aggregate  costs  for  each  design  alternative  that  is  being  considered  (i.e.,  each  alternative  that  is  open  on  the 
desktop)  on  a  separate  line.  The  view  window  is  also  referred  to  as  the  Project  Desktop  since  it  provides  access 
to  project-level  variables,  such  as  design  quantity  and  the  constant-year  dollars  for  the  project.  A  toggle  control 
button  on  the  main  toolbar  controls  whether  the  detailed  cost  report  is  displayed  or  hidden. 

A  detailed  cost  report  is  prepared  for  each  alternative  design;  i.e.  a  cost  report  is  “attached”  to  each 
user  view  that  was  described  above.  A  toggle  control  button  on  the  main  toolbar  controls  whether  the  detailed 
cost  report  is  displayed  or  hidden.  A  portion  of  the  detailed  cost  report  is  shown  in  Figure  9.  Detailed  cost 
information  is  provided  for  each  item  in  the  design  and  includes  not  only  a  further  breakdown  of  the  cost 
elements  (direct  labor,  material,  etc.)  but  the  key  cost  factors  that  are  used  in  the  cost  calculations  (learning 
curve  factor,  labor  rates,  burden  rates,  etc.). 
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Figure  6  -  Basic  user  view  for  specifying  the  primary  manufacturing  process. 
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Figure  7 


-  Basic  user  view  for  specifying  the  assembly  of  items. 
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Figure  9  -  Detailed  cost  report  for  a  specific  alternative. 


10-14 


Both  cost  reports  -  summary  and  detailed  -  are  updated  as  soon  as  a  change  is  made  in  the  design 
specification,  thereby  providing  immediate  feedback  to  the  designer  or  IPT  on  the  affect  of  the  change. 

Administration  "view"  of  the  OCEANS  system 

The  second  view  offered  by  the  OCEANS  system  is  referred  to  as  the  administration  view.  It  provides 
the  capability  to  maintain  the  information  in  the  system  —  update  existing  information  (e.g.  material  prices, 
labor  rates,  and  burden  rates)  and  enter  new  information  (e.g.  new  forms,  materials,  processes,  and  estimating 
relationships)  into  the  system.  The  administration  part  of  OCEANS  is  used  to  maintain  information  that  is  not 
under  the  control  of  the  user  (designer,  IPT)  and  is  usually  common  across  all  projects.  It  is  maintained 
centrally;  whereas,  the  user  portion  is  controlled  and  maintained  locally.  Recall  the  user  can  easily  create  new 
projects  and  alternative  designs  within  projects. 

There  are  actually  two  modes  -  browse  and  change  —  for  the  administration  portion  of  OCEANS. 
While  both  modes  have  the  same  interface,  the  browse  mode  allows  users  to  only  view  the  information  from  the 
administration  perspective;  whereas,  the  change  mode  allows  users  to  modify  the  information.  Typically,  the 
primary  user  of  the  OCEANS  system,  designers  and  IPTs,  would  not  have  authority  to  utilize  the  change  mode 
of  the  administration  view.  However,  it  may  be  useful  for  the  user  to  browse  or  query  the  basic  information 
included  in  the  system  through  the  administration  view.  Access  to  the  information  in  the  different 
administration  view  modes  is  controlled  through  passwords  or  other  security  measures. 

Screen  shots  are  again  used  in  this  section  to  illustrate  the  interface  and  describe  the  information 
administration  capabilities  of  the  OCEANS  system.  A  “tabbed”  interface  is  employed,  similar  to  those 
described  earlier  in  the  user  view  section.  The  screen  shots  reflect  the  change-mode  interface  that  allows 
information  in  the  system  to  be  modified,  added,  or  deleted. 

Figure  10  shows  the  administrative  view  for  information  related  to  materials.  All  of  the  materials 
contained  in  the  system  are  listed  in  the  window  in  the  right-hand  portion  of  the  screen.  When  a  material  is 
selected  (highlighted)  in  the  window,  its  property  values  are  displayed  in  the  left-hand  portion  of  the  screen.  If 
the  user  has  proper  authority,  they  may  modify  these  values  (price,  units,  description,  etc.).  In  browse-only 
mode,  these  property  values  would  be  displayed  with  a  gray  rather  than  white  background,  indicating  that  they 
can  not  be  changed.  The  OCEANS  database  is  updated  once  one  of  the  modify,  insert,  or  remove  buttons  are 
selected.  In  the  browse-only  option,  the  modify/insert/remove  buttons  are  “grayed-out”  and  hence  inactive.  As 
mentioned  earlier,  the  next  version  of  OCEANS  will  contain  a  more  extensive,  and  dynamic,  list  of  properties. 
This  change  will  convert  the  materials  list  to  a  materials  hierarchy  and  the  properties  that  are  displayed  will 
depend  on  the  materials  class  that  is  selected.  For  example,  a  heat  treat  property  will  only  be  present  on  metallic 
materials  and  not  present  on  composites.  The  OCEANS  system  will  also  have  the  ability  to  connect  to  other 
databases  in  order  to  directly  provide  property  values.  For  example,  material  prices  may  be  maintained  in 
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Figure  10  -  Administration  view  for  maintaining  information  on  materials 


Figure  1 1  -  Administration  view  for  maintaining  information  on  processes. 
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another  database;  therefore,  the  value  displayed  will  not  be  directly  editable  in  OCEANS.  In  this  case,  the 
property  value  for  material  price  will  actually  be  a  pointer  or  path  designation  to  an  element  in  another 
database. 

Figure  1 1  shows  the  administration  view  for  manufacturing  processes.  Its  interface  and  operation  is 
similar  to  that  for  the  Materials  tab.  The  only  process-related  properties  that  are  contained  in  this  version  of 
OCEANS  are  recurring  and  non-recurring  labor  rates  and  a  learning  curve  factor.  The  next  version  will  extend 
this  set  of  properties  considerably. 

Figures  12  and  13  show  the  administrative  views  that  provide  the  means  for  defining  cost  estimating 
relationships  and  assigning  them  to  specific  form-material-process  combinations.  As  discussed  earlier,  the  cost 
estimating  relationships  are  those  provided  in  the  MC/DG  ([6,  7]).  The  MC/DG  reports  contain  only  a  graph  of 
each  relationship  -  the  underlying  data  or  equations  for  the  relationships  are  not  available.  For  the  purpose  of 
this  prototype,  piecewise-linear  approximations  of  the  relationships  are  used.  Figure  12  shows  the  interface  for 
the  CED  tab.  In  this  example,  the  relationship  labeled  CED-A-1A  is  specified.  (The  CED-A-1A  label  follows 
the  naming  convention  used  in  the  MC/DG.)  The  X,Y  values  that  are  used  to  define  the  piecewise-linear 
relationship  is  entered  in  the  CED  Data  Set  section  of  the  screen.  A  graph  of  the  relationship  is  automatically 
displayed  in  the  “CED  Preview”  window  included  in  the  lower  right-hand  portion  of  the  screen.  This  estimating 
relationship  is  linked  or  assigned  to  a  specific  form-material-process  combination  through  the  “CED  Index”  tab. 

The  “CED  Index”  tab  is  shown  in  Figure  13.  In  this  example,  any  item  that  is  made  of  “Aluminum,”  is 
of  the  form  “Angle,  Cylindrically  Contoured,”  and  is  produced  using  a  “Brake  Form”  process  uses  the 
estimating  relationship  CED-A-1A  for  recurring  costs  and  CED- A- IB  for  non-recurring  costs.  If  a  form- 
material-process  combination  has  not  been  assigned,  the  system  will  warn  the  user  of  that  fact  and  will  use  the 
first  compatible  process  for  the  specified  form-material  combination,  if  one  exists.  Incompatibility  of  specific 
form-material-process  combinations  may  be  defined  through  this  window  by  selecting  the  appropriate  values 
for  form,  material,  and  process  in  the  top  part  of  the  screen  in  Figure  13  and  then  checking  the  “Incompatible” 
box  in  the  lower  left-hand  portion  of  the  screen. 

A  later  release  of  the  OCEANS  system  will  contain  a  more  flexible  cost  module  that  will  allow  a 
variety  of  types  of  cost  relationships  obtained  from  different  sources  to  be  defined  and  selected  by  the  user. 

Future  Research  and  Development  Directions 

The  prototype  that  is  described  in  this  report  was  demonstrated  to  the  Lab  focal  point  for  this  project, 
representatives  from  a  major  aircraft  manufacturer,  and  two  software  development  companies.  Everyone  was 
very  supportive  of  the  work  to  date  and  encouraged  future  development.  They  all  felt  that  it  is  critical  to  have 
tools/environments  like  the  OCEANS  system  available  early  in  the  design  process  in  order  to  develop 
affordable  products.  All  of  the  reviewers  had  very  helpful  comments  and  suggestions,  many  of  which  the  author 
was  aware  of  but  inclusion  of  those  capabilities  in  the  prototype  was  beyond  the  scope  of  the  project.  Many  of 
these  ideas  are  presented  below.  They  set  an  agenda  for  future  development. 
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Figure  12  -  Administration  view  for  defining  cost  estimating  relationships. 


Figure  13  -  Administration  view  for  assigning  cost  estimating  relationships. 
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As  indicated  earlier,  while  the  OCEANS  prototype  is  a  fully-functional  system,  it  has  limited 
capability,  is  based  on  a  limited  application  domain,  and  includes  a  portion  of  the  data  and  expertise 
from  a  single  source,  the  Manufacturing  Cost/Design  Guide  ( MC/DG )  ([6, 7]).  However,  these  limitations 
are  within  the  defined  scope  of  the  project.  While  knowingly  limited  in  terms  of  its  breadth  and 
depth,  this  prototype  demonstrates  a  proof  of  concept  for  an  object-based  cost  assessment 
environment  that  supports  the  design  process.  As  such,  it  provides  a  very  important  foundation  for 
future  developments.  For  example,  it  is  a  vehicle  for  generating  dialogue  with  industry  in  order  to 
further  define  the  capabilities  that  are  required  in  order  to  effectively  support  the  design/ IPPD 
processes.  Throughout  this  report,  references  have  been  made  to  how  the  OCEANS  system  will  be 
improved  in  subsequent  versions.  Those  enhancements  are  briefly  summarized  below.  This  is 
followed  by  a  preliminary  description  of  a  more  complete  OCEANS  application  environment  and 
how  it  might  be  utilized  in  the  design  process.  The  report  concludes  with  a  brief  discussion  of  the  next 
logical  step  in  the  development  process. 

The  development  of  this  prototype  has  demonstrated  that  the  use  of  an  object-based  approach  is  not 
only  viable  but  an  extremely  important  technology  on  which  to  build  a  system  that  will  effectively  provide  an 
integrated  cost  assessment  capability  during  design.  However,  the  OCEANS  system  must  be  more  object-based 
than  its  current  implementation.  Some  of  the  major  capabilities  that  are  needed  are:  more  object-oriented  data 
structures,  hierarchies  with  property  inheritance,  knowledge  storage  and  processing,  flexible  and  open  cost 
models,  and  links  to  other  platforms,  data,  and  programs.  As  mentioned  earlier,  these  most  likely  will  be 
achieved  by  utilizing  such  software  as  OZ  and  Knowledge  Center.  The  system  also  needs  broader  and  deeper 
information  on  product  forms,  materials,  and  manufacturing  processes.  The  OCEANS  system  needs  to 
incorporate  a  means  to  better  link  cost  elements  to  design  parameters  and  be  able  to  better  handle  product 
assembly  and  integration  issues.  Many  of  these  concerns  can  be  resolved  through  interactions  with  industry.  In 
fact,  many  aspects  of  the  OCEANS  system  will  be  greatly  enhanced  by  applying  the  system  in  a  real-world 
setting  and  implementing  the  information  and  suggestions  that  result  from  these  endeavors  into  future  versions. 
Demonstrating  and  testing  the  OCEANS  system  in  a  real-world  environment  provides  a  rich  opportunity  to 
capture  and  represent  information  on  real  products,  processes,  and  cost  models,  as  well  as  expertise  from 
designers  and  IPTs. 

The  OCEANS  prototype  is  a  first  step  towards  a  comprehensive  software  system  and  application 
environment.  A  vision  for  how  such  a  system  would  effectively  integrate  cost  assessment  into  the  design 
process  is  illustrated  in  Figure  14.  The  system  would  provide  a  strong  link  between  members  of  an  IPT  and  the 
product/process  design  environment.  It  would  also  be  considered  an  evolving  application,  in  that  its  capabilities 
and  resources  would  continually  be  updated  and  enhanced  based  on  lessons  learned  by  the  IPTs  and  through 
research  projects.  The  system  would  itself  be  a  valuable  research  tool  and  would  provide  a  powerful  learning 
environment  for  academe  and  industry. 
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Figure  14:  The  envisioned  application  environment  for  the  OCEANS  system. 


The  primary  components  of  the  system  are  shown  in  the  shaded  OCEANS  box  in  Figure  14.  The  F-M- 
P  (Form-Material-Process)  database  contains  instances  of  the  object  classes  and  is  populated  with  a  variety  of 
design  alternatives  so  that  new  design  considerations  are  able  to  draw  upon  and  effectively  use  this  history.  The 
knowledge  base  contains  rules,  element  descriptions,  notes  and  annotations  from  past  use,  etc.  that  further 
define  and  support  the  development  of  relationships  among  the  objects  that  are  used  to  assess  product/process 
cost.  The  system  is  composed  of  numerous  methods,  models,  tools,  etc.  that  define  the  behavior  of  objects  and 
provide  the  computational  capability  of  the  system.  Providing  effective  feedback  to  designers  and  IPTs,  in 
forms  that  are  meaningful  and  useful  to  them,  is  critical  to  any  system’s  success.  Therefore,  “interfaces”  is 
defined  as  a  primary  element  of  the  system.  Interfaces  include  the  means  for  all  users  of  the  system  to 
effectively  and  efficiently  provide  input  to  and  receive  output  from  OCEANS.  Another  important  capability  of 
OCEANS  is  the  generation  of  standard  and  ad  hoc  reports  to  document  and  support  the  design  process. 

The  final  major  component  of  the  OCEANS  software  system  is  its  underlying  structure.  An  objected- 
based  framework  provides  the  needed  flexibility  that  allows  different  users  to  represent,  manage,  and  query  the 
data,  construct  and  use  objects,  specify  and  use  methods,  define  output,  etc.  to  best  meet  their  needs.  That  is,  the 
system  should  be  capable  of  being  easily  redefined  or  reconfigured  in  order  to  best  represent  a  specific  product, 
project  or  company  organization,  level  of  expertise,  design  requirements  and  considerations,  etc.  Similarly,  the 
object-based  structure  must  accommodate  increasing  levels  of  detail  as  a  design  evolves. 
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In  order  to  effectively  support  the  design  and  IPPD  processes,  it  is  imperative  that  the  system 
facilitates  and  encourages  communication  and  collaboration  among  member  of  the  IPT.  It  must  also  easily  and 
seamlessly  link  to  and  integrate  with  methods,  models,  and  tools  from  other  applications. 

As  mentioned  earlier,  the  most  effective  way,  at  this  point  in  the  development  process,  to  expand  the 
capability  of  the  OCEANS  prototype  is  through  interaction  with  industry.  That  is,  to  apply  the  prototype  system 
to  actual  case  examples  and  represent  specific  problem  domains,  products,  and  processes  in  the  system  s  object- 
based  context.  This  is  best  accomplished  through  collaboration  between  the  developers  of  OCEANS  and 
industry,  including  product  developers  and  manufacturers,  as  well  as  software  developers.  The  industry 
interactions  not  only  provide  data  and  expertise  on  the  products  being  designed  and  the  processes  that  are  used 
to  produce  the  products,  but  provide  a  richer  understanding  and  appreciation  of  the  design  and  IPPD  processes. 
This  understanding,  along  with  feedback  from  industry  as  the  OCEANS  system  evolves  will  result  in  an 
application  that  better  meets  the  commercial  and  military  needs  to  develop  affordable  products. 

Therefore,  a  critical  next  step  in  the  evolution  of  the  OCEANS  system  is  to  involve  industry  partners 
in  order  to  provide  the  expertise  and  feedback  that  are  necessary  in  order  to  further  refine  OCEANS  conceptual 
approaches,  populate  its  object  base,  and  enhance  its  overall  functionality.  As  illustrated  in  Figure  15,  industry 
involvement  in  this  endeavor  is  currently  being  pursued  along  two  fronts:  (1)  product/process  design  and 
costing  and  (2)  software  application  design  and  development.  The  most  likely  industry  partner  for  the  first  front 
is  Northrup-Grumman’s  Commercial  Aircraft  Division  (NG-CAD).  This  is  based  on  the  author  s  involvement 
with  the  company  through  the  C-17  Manufacturing  2005  Program  ([8,  9]).  Other  potential  industry  partners  that 
have  been  identified  include  The  Aerostructures  Corporation,  an  aircraft  component  manufacturer,  and  SBIRE, 
a  virtual  manufacturing  enterprise.  Two  likely  partners  from  the  software  industry  are,  as  previously  discussed, 
Cognition  Corporation  and  Knowledge  Base  Engineering  (KBE). 


product/process  design  and  costing 

Northrup-Grumman't  Commercial  Aircraft  DivWon 


Figure  15:  Industrial  partners’  involvement  with  future  OCEANS  systems  development. 
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Abstract 


The  elastic  misfit  stresses  in  TiAl  alloys  assume  huge  level  of  the  order  of 
the  product  of  the  elastic  moduli  times  misfit  deformations.  The  elastic  energy 
associated  with  these  stresses  is  enormously  large  also  and  it  tries  to  relax  via 
one  of  several  possible  mechanisms.  Hence,  there  appears  a  driving  force  for  the 
rearrangement  of  the  material  elements  leading  to  the  evolution  accompanied  by 
the  creation  of  dislocations,  migration  of  vacancies,  development  of  interfacial 
corrugations,  etc... 

The  goal  of  this  research  project  is  two-fold:  i)to  develop  a  theoretical  ap¬ 
proach  which  permits  a  self-consistence  calculations  of  misfit  stresses  in  TiAl 
lamellar  multilayers  and  like  structures,  and  ii)to  develop  a  thermodynamic  ap¬ 
proach  allowing  one  to  study  the  possibility  of  stress  relaxation  by  means  of  mass 
rearrangement  in  the  vicinity  of  the  interface  between  the  lamellae.  We  assume 
that  the  rearrangement  is  accompanied  by  interfacial  diffusion  and/or  migration 
of  the  vacancies.  Since  the  full  consideration  of  the  coherent  interface  solid-solid 
includes  lots  of  calculation  and  details  we  illustrate'  our  results  and  approach  by 
considering  the  much  simpler  but  conceptually  a  very  close  problem  of  mass  re¬ 
arrangement  in  the  vicinity  of  a  stress-free  interface. 
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1.  ORGANIZATION  OF  THE  REPORT 


This  report  is  organized  as  follows.  First,  we  give  an  information  concerning  actual 
and  forthcoming  publications  and  of  work  in  progress  supported  by  this  Grant.  Then, 
we  present  our  main  results  in  3  concenters  which  can  be  read  independently  although 
all  of  them  deal  with  the  same  object:  a  solid  substance  with  interfaces.  The  separa¬ 
tion  in  the  3  concenters  is  intended  to  give  access  to  the  problem  at  hand  to  materials 
scientists  with  different  background.  First,  we  present  our  results  relating  to  misfit 
strains  and  stresses  in  Ti-Al  lamellar  alloys  in  the  form  mostly  convenient  for  ma¬ 
terials  scientists:  both  theorists  and  experimenters.  In  this  section  of  the  report  we 
follow  our  forthcoming  paper  ’’Internal  Stresses  in  Lamellar  Ti-Al”,  prepared  in  tight 
collaboration  with  physicists  and  materials  scientists  working  at  WPL.  We  skip  any 
lengthy  calculations  and  technical  details  and  concentrate  basically  on  the  concepts 
and  formules  which  seem  mostly  useful  for  interested  practitioners  with  background 
in  materials  science  and  mechanical  engineering.  In  order  to  establish  the  simplest 
transparent  formules  we  do  not  dwell  here  on  any  general  thermodynamic  paradigms, 
schemes  and  secondary  details.  Also,  we  give  preference  to  the  simplest  and  the  most 
robust  models  of  isotropic  lamellae  and  of  misfit  deformations.  We  ignore  even  the 
role  of  temperature  and  various  defects  but  the  most  essntial  lamellar  interfaces.  All 
these  simplifications  allow  us  to  establish  quite  simple  robust  formules  and  qualitative 
conclusions  allowing  numerous  generalisations.  Then,  we  proceed  with  presentation 
of  a  much  more  general  and  fundamental  theoretical  consideration  which  was  used  to 
establish  the  basic  formules  of  the  previous  section  and  which,  hopefully,  will  become 
useful  for  future  applications  in  various  topics  of  materials  science  and  engineering 
dealing  with  lamellar  microstructure.  In  the  last  concenter  we  present  our  results 
relating  to  evolution  of  lamellar  microstructure:  the  presentation  is  based  on  the 
recently  published  paper  ’’Role  of  Vacancies  in  Slow  Evolution  of  a  Traction-FVee  In¬ 
terface”.  In  order  to  keep  this  report  reasonably  concise,  we  consider  the  problem  of 
evolution  of  lamellar  microstructure  dealing  with  the  mechanism  based  on  evolution  of 
vacancies.  Even  this  simplest  mechanism  allows  to  explain  nucleation  of  dislocations, 
destruction  of  of  alloys,  and  many  other  phenomena.  Certainly,  these  phenomena 
of  paramount  interest  in  different  branches  of  materials  science  and  engineering  may 
have  several  other  explanations.  We  believe  though  that  many  of  other  mechanisms 
share  essential  features  of  the  mechanism  considered  in  this  report:  it  is  the  relax¬ 
ation  of  overall  stresses  based  on  the  development  of  interfacial  corrugations  which 
are  known  as  the  Stress  Driven  Rearrangement  Instability. 

2.  PUBLICATIONS  RESULTING  FROM  THE  GRANT 

M.A.  Grinfeld,  P.H.  Hazzledine,  Role  of  Vacancies  in  Slow  Evolution  of  a  Traction- 
Free  Interface.  Europhys.  Lettr.,  1997,  37,  pp.  409-413. 


11-5 


M.A.  Grinfeld,  P.H.  Hazzledine,  B.Shoykhet,  D.  M.  Dimiduk,  Internal  Stresses  in 
Lamellar  Ti-Al,  Metallurgical  and  Materials  Transactions  (accepted). 

M.A.  Grinfeld,  Stress  Corrosion  Cracking  in  Aggressive  Environment,  Acta  Met- 
allurgica  et  Materialia  (accepted). 

B.  Shoykhet,  M.A.  Grinfeld,  P.M.  Hazzledine,  Internal  Stresses  and  Strains  in 
Coherent  Multilayers,  Acta  Metallurgica  et  Materialia  (accepted). 

P.M.  Hazzledine,  B.  Shoykhet,  M.A.  Grinfeld,  Coherency  Strains  and  Stresses  in 
Lamellar  Ti-Al  (Proceedings  of  Tbusier  Conference  on  ’’Modern  Problems  of  Disloca¬ 
tions). 

M.A.  Grinfeld,  P.H.  Hazzledine,  B.  Shoykhet,  Geometrically  Nonlinear  Theory  of 
Lamellar  Microstructure  (in  preparation). 

B.  Shoykhet,  P.M.  Hazzledine,  M.A.  Grinfeld,  Homogenization  Methods  for  Lamel¬ 
lar  Microstructure  (in  preparation). 

3.  INTERNAL  STRESSES  IN  LAMELLAR  Ti-Al  (physical  dis¬ 
cussion) 

3.1.  Resume  of  the  Section 

In  this  Section  we  give  a  coherent  physical  presentation  of  the  more  general  results 
presented  in  the  next  Section.  We  specialize  the  more  general  formules  of  the  next 
Section  for  the  case  of  lamellar  structure  in  Ti-Al  alloys  which  were  the  main  concern 
of  our  study.  General  formulae  are  given  for  the  coherency  strains  and  stresses  in  a 
multilayer  far  from  the  free  surfaces  (a  detailed  derivation  of  the  strain  fields,  including 
edge  effects,  is  hard  and  it  is  planned  to  be  studied  in  near  future).  The  multilayer 
is  assumed  to  be  a  periodic  stack  of  different  elastically  isotropic  materials  but  there 
may  be  any  number  of  layers  in  the  stack  and  they  may  each  have  any  thickness 
and  any  elastic  constants.  The  results  are  applied  to  lamellar  Ti-Al  alloys  in  which 
there  are  shear  misfits  between  different  7  layers  and  both  shear  and  biaxial  misfits 
between  the  7  and  a2  layers.  In  a  fully  coherent  multilayer  the  stresses  would  be 
large,  in  the  GPa  range,  and  in  high  strength,  thin  lamella,  alloys  the  coherency 
stresses  are  a  substantial  fraction  of  a  GPa.  The  shear  stresses  act  principally  on  hard 
mode  deformation  systems  and  the  biaxial  stresses  place  every  a2  lamella  in  biaxial 
compression.  This  biaxial  compression,  which,  for  dislocation  glide,  is  equivalent  to 
a  uniaxial  tension  normal  to  the  lamella  is  particularly  large  when  the  a2  volume 
fraction  is  small. 

3.2.  Physical  motivation  and  qualitative  estimates 

When  a  thin  film  is  deposited  epitaxially  on  a  single  crystal  substrate  its  lattice  pa¬ 
rameter  initially  matches  that  of  the  substrate.  Consequently  the  film  acquires  an 
elastic  strain,  which  may  be  large,  while  the  substrate  remains  almost  unstrained. 
The  lattice  parameter  mismatch  which  causes  the  strain  generally  varies  with  tem- 
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perature  because  expansion  coefficients  vary  from  material  to  material.  As  the  film 
thickens,  the  elastic  strain  remains  constant  until  the  energy  of  the  bimaterial  can  be 
reduced  by  the  relaxation  of  the  elastic  strain  through  a  modification  of  the  interfaces. 
Substantial  elastic  strains  persist  in  films  which  are  thicker  than  this  coherence  limit 
[9].  In  a  multilayer  the  behavior  is  similar  except  that  the  strains  are  shared  between 
the  different  materials.  Other  things  being  equal,  the  thinner  layers  and  those  with 
lower  elastic  moduli  suffer  more  strain.  Most  calculations  of  elastic  strains  in  mul¬ 
tilayers  have  focussed  on  semiconductor  multilayers  in  which  the  crystal  structures 
of  the  different  layers  are  all  the  same.  Consequently  the  stresses  are  equi-biaxial 
tensions  and  compressions  while  the  strains  are  triaxial  because  of  the  Poisson  effect. 
Multilayers  with  graded  interfaces  may  be  handled  by  subdividing  the  multilayer  into 
infinitessimally  thick  layers  [10].  Some  multilayers  for  structural  applications  are  also 
composed  of  materials  with  the  same  crystal  structures  and  their  biaxial  tensions  and 
compressions  are  important  in  influencing  their  fracture  behavior  [1].  In  other  struc¬ 
tural  materials,  however,  the  crystal  structures  do  not  match  and  even  the  symmetries 
in  the  layer  plane  are  different;  a  case  in  point  is  Ti-Al.  In  lamellar  Ti-Al  alloys  a 
hexagonal  a i  phase  and  a  tetragonal  g  phase  are  joined  on  their  close  packed  planes. 
The  £*2  lattice  parameter  is  larger  than  the  lattice  repeat  of  7  so  biaxial  tensions  and 
compressions,  similar  to  those  in  semiconductors  are  necessary.  In  addition,  the  02 
structure  has  six-fold  symmetry  while  the  7  structure  lacks  even  three-  fold  symmetry 
so  shears  are  necessary  to  achieve  coherency.  The  Ti-Al  alloy  multilayer  has  a  wide 
spread  of  layer  thicknesses.  In  order  to  calculate  the  coherency  stresses  a  theory  is 
required  which  can  handle  large  numbers  of  layers,  of  widely  differing  thicknesses, 
different  sets  of  elastic  constants  and  different  lattice  repeats.  A  first  estimate  of 
the  stresses  [13]  showed  that  in  polysynthetically  twinned  (PST)  material  with  1mm 
thick  layers,  stresses  of  the  order  of  50MPa  would  be  expected  and  these  have  been 
detected  experimentally  [13],  [2],  In  PST  the  coherency  stresses  are  a  small  fraction 
of  the  yield  stress  .  The  motive  for  this  much  more  elaborate  calculation  is  that 
the  new  high  strength  Ti-Al  alloys  have  much  finer  lamellae  and  consequently  much 
larger  coherency  stresses.  In  a  partially  coherent  multilayer  like  Ti-Al,  with  layer 
thickness//,  the  coherency  stresses  fall  as  i/_1  while  the  yield  stress  falls  as  i/-1/2 
so  the  coherency  stresses  become  increasingly  significant  as  the  layers  thin.  When  H 
is  about  lOOnm,  the  calculations  in  this  paper  show  that  the  largest  coherency  stress 
is  as  large  as  the  yield  stress  and  is  therefore  capable  of  markedly  influencing  the 
mechanical  properties. 

3.3.  Stresses  and  Strains  in  Coherent  Multilayers  in  Ti-Al-alloys 

Consider  a  multilayer  consisting  of  a  stack  of  flat  layers  of  different  materials,  with  uni¬ 
form  thicknesses  and  planar  dimensions  very  much  larger  than  their  thicknesses  (see, 
Figure  1).  Each  layer  may  have  different  elastic  constants,  assumed  to  be  isotropic. 
The  layers,  which  are  parallel  to  the  x3  plane,  have  individual  thicknesses  Hk  and  are 
arranged  in  a  periodic  stack  with  N  members. N  may  take  any  value  and  the  total 
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thickness  of  the  periodic  stack  is  H .  The  fact  that  the  individual  layers  are  different 
materials  implies  that  their  lattice  constants  would  not  match  if  they  were  not  joined. 
When  they  are  joined  in  a  fully  coherent  multilayer,  all  the  lattice  parameters  in 
the  plane  of  the  layers  (but  not  perpendicular  to  it)  must  be  the  same,  consequently 
elastic  strains  and  stresses  exist  in  every  layer.  The  fact  that  there  is  no  requirement 
for  matching  along  x3  (the  layers  can  expand  or  contract  along  x3  freely)  indicates 
that  this  is  a  plane  stress  problem,  P 33  =  0.  Clearly  the  magnitude  of  the  elastic 
strains  depends  on  the  initial  misfit,  A,  defined  as  the  strain  required  to  transform 
some  reference  cell  into  the  cell  of  the  particular  free  layer.  In  the  coherent  state,  each 
layer  may  be  thought  to  contain  two  strains,  its  A  and  its  elastic  strain  K.  The  sum 
of  these  two  (A  +  K)  must  be  the  same  in  every  layer  at  coherency.  The  aim  is  to 
calculate  the  elastic  strain  in  each  of  the  N  layers  in  the  period  of  the  multilayer.  In 
principle,  N  may  be  very  large  but  a  physical  argument  shows  that  it  is  the  number 
of  crystallographic  variants  which  determines  the  size  of  the  calculation,  rather  than 
N.  In  a  Ti-Al  multilayer  containing  N  layers  in  a  period  there  are  only  four  distinct 
variants  (a  variant  has  a  unique  combination  of  A,  and  n  where  m  is  the  shear  mod¬ 
ulus  and  n  the  Poisson  ratio).  In  the  coherent  multilayer,  all  the  lattice  parameters 
match  so  each  layer  belonging  to  a  given  variant  must  have  the  same  strain  K  and 
therefore  the  same  stress  P .  It  makes  no  difference  to  the  argument  where  the  individ¬ 
ual  members  of  a  variant  are  located  in  the  multilayer  so  they  may,  for  convenience, 
be  collected  into  a  single  ’ply’.  The  problem  of  calculating  the  strains  and  stresses 
in  a  Ti-Al  multilayer  containing  thousands  of  layers  is  then  reduced  to  the  problem 
of  calculating  the  strains  and  stresses  in  a  four-ply  multilayer.  It  should  be  noted, 
however,  that  this  is  a  conceptual  process  rather  than  a  real  one  because  individual 
layers  must  be  very  thin  to  be  coherent.  By  the  time  the  layers  have  been  collected 
into  a  ply,  its  thickness  may  be  far  above  the  limiting  thickness  for  coherency  so  a 
’conceptual’  four-ply  has  larger  coherency  strains  in  it  than  a  real  four-  ply.  At  the 
boundaries  of  the  multilayer,  the  strains  vary  over  a  distance  comparable  with  the 
thickness  of  a  layer  but  far  from  the  edges  they  are  uniform.  In  this  paper  we  examine 
the  uniform  strains  within  each  layer  of  the  multilayer,  strains  which  necessarily  have 
a  discontinuity  at  every  interface  to  ensure  coherency.  The  fields  obey  the  normal 
rules  of  strain  compatibility  and  stress  equilibrium  within  a  layer  and  they  provide 
mechanical  equilibrium  at  each  interface.  The  strains  scale  in  such  a  way  that  the 
volume  average  of  each  stress  component  is  zero,  as  it  must  be  in  a  solid  with  no 
external  loads.  The  zero  volume  average  stress  condition  brings  the  multilayer  to  a 
state  of  minimum  strain  energy. 

In  order  to  present  the  results  of  the  next  Section  in  the  most  transparent  form 
the  following  notions  and  notation  are  required.  We  denote  N  the  total  numer  of  the 
layers  in  the  stack  with  individual  thickness  Hk,  shear  modulus  fik  ,  Poisson  ratio  i /k, 
and  biaxial  modulus  Mk  =  2/ik(l  +  i/k)/(l  —  vk).  Two  additional  average  parameters 
of  the  stack  are  required.  The  first  is  an  integrated  shear  modulus  S  =  ^2Hk/ik  and 
the  second  is  an  integrated  biaxial  modulus  B  =  HkMk  .  The  symmetric  strain 
tensor  in  layer  L  is  then  given  as  a  function  of  material  parameters,  S,  B  and 


transformation  strains  A  as  follows: 


KLj)  =  -k  5Z  (24r«iw  -  +  $™»nn*)  (a^  -  A^)  H(3.1) 

M=L 

±  £  HmMm  (s^  -  LtZks^mn^  (a^  -  a£">) 

In  terms  of  the  same  parameters,  the  stress  tensor  P^f  in  layer  L  is 


(3.2) 

Ff  =  ( S§  £  (2 6“6i  -  F*6,J)  +  H  £  (a£w)  -  A^) 

i  M=L  M—L  J 

In  both  eqs.  3.1  and  3.2,  roman  indices  run  from  1  to  3  while  greek  indices  run  from 
I  to  2.  t i{  =  5?  and  d  is  a  Kronecker  delta.  The  meaning  of  A^1^  —  A^^  is  the 
strain  required  to  transform  the  cell  of  layer  L  to  the  cell  of  layer  M  in  the  free  state. 


3.4.  Geometry  of  lamellar  Ti-Al 

The  form  of  the  multilayers  in  Ti-Al  is  shown  in  Figure  2.  The  lamellar  structure 
is  derived  from  a  grain  of  »2  by  an  incomplete  transformation  from  the  hexagonal 
D019  phase  to  the  tetragonal  7  phase  with  the  Llo  structure.  The  (*2  lamellae 
are  the  remnants  of  the  original  02  grain  and  consequently  they  all  have  the  same 
crystallographic  orientation.  The  plane  of  the  lamellae  is  the  (0001)  basal  plane 
and  the  plane  of  the  7  lamellae  is  {111}.  Close  packed  directions  in  all  the  crystals 
are  approximately  aligned,  consequently  there  are  six  rotational  variants  of  the  g 
lamellae  ( labelled  70,  760— -7300,  the  rotation  is  anticlockwise  with  respect  to  variant 
0)  of  which  four  are  shown  in  figure  2.  Lamellar  Ti-Al  alloys  take  two  forms:  Either 
they  are  derived  from  a  single  Q2  grain  and  referred  to  as  polysynthetically  twinned 
(PST)  or  they  are  derived  from  polycrystalline  02  and  referred  to  as  fully  lamellar 
(FL).  In  both  cases,  within  one  lamellar  colony,  a  small  fraction  of  the  lamellae  are 
continuous  c*2  crystals  and  the  remainder  are  7  crystals  which  are  divided  into  domains 
(themselves  different  rotational  variants  of  7)  ,  see  Figure  2.  Typically,  the  grain  size 
of  FL  material  or  the  dimensions  of  a  PST  specimen  are  large,  of  order  mm  or  cm. 
The  domain  size  is  typically  50mm  and  the  lamellar  thicknesses  are  in  the  range  20  nm 
to  1mm.  In  all  cases,  therefore,  the  crystals  have  much  larger  dimensions  in  the  plane 
of  the  lamellae  than  normal  to  it.  The  L10  structure  is  most  conveniently  described 
as  face  centred  cubic  with  alternate  {002}  planes  occupied  by  Ti  and  A1  atoms.  If  the 
c/a  ratio  of  this  unit  cell  were  1.0,  all  of  the  7  variants  would  be  fully  coherent  with 
one  another  but  they  would  not  be  coherent  with  e*2  because  the  lattice  parameter  a 
of  »2  is  larger  than  ay/ 2  in  7.  Coherency  could  be  achieved  by  the  presence  of  biaxial 
tensions  in  g  and  biaxial  compressions  in  02.  In  fact,  c/a  of  7  is  about  1.02  and  this 
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implies  that  only  twinned  pairs  of  g  variants  match  on  the  {111}  planes  (70  matches 
with  718O)  760  matches  with  7240  etc.).  Shear  strains  are  required  in  the  7  lamellae 
to  make  them  fully  coherent.  In  a  7/0:2  material,  both  biaxial  tension/compression 
and  shears  are  needed.  In  the  theory  of  section  2,  the  coherency  strains  contain  A, 
the  strain  required  to  transform  the  dimensions  of  a  unit  cell  of  a  reference  lattice 
into  the  dimensions  of  the  unit  cell  of  a  free  lamella.  In  the  theory  only  the  in-plane 
dimensions  are  matched,  the  unit  cells  may  differ  in  length  along  the  x3  direction.  In 
principle,  any  unit  cell  could  be  used  as  the  reference  lattice  (because  all  the  formulae 
contain  differences  between  two  A  tensors)  but  here  a  choice  is  made  to  use  the  unit 
cell  of  the  70  variant  (or,  equally,  the  7180)  as  the  reference,  so  A0  =  A180  =  0.  The 
remaining  transformation  strains  are  defined  as  the  strains  required  to  transform  the 
70  cell  into  the  cell  in  question.  Setting  x1  parallel  to  <  110  >  in  the  70  variant,  the 
transformation  strains  are: 


A60  = 

A 120  = 


Aq2  = 


A240  =  | 


A300  =  ^ 


1  -l/v/3 

— l/v/3  -1 

1  l/v/3  ] 

l/v/3  -1 


•  0 
[  0  e  —  2e/3 


(3.3) 

(3.4) 

(3.5) 


where  e  =  (c/a  -  1)  in  the  7  phase  and  s'  =  (a„2  -  \/2a7)  /aa2  is  one  of  the  mis¬ 
matches  between  a2  and  7. 


4.  STRESSES  AND  STRAINS  IN  LAMELLAR  Ti-Al 

The  stresses  and  strains  in  any  layer  of  a  coherent  Ti-Al  multilayer  can  be  found  from 
equations  3.1  and  3.2  so  long  as  the  volume  fractions  of  the  four  components,  the 
elastic  constants  and  the  lattice  parameters  are  known.  In  this  section,  the  stresses 
and  strains  in  three  idealised  forms  of  Ti-Al  are  given  explicitly  to  act  as  a  guide  for 
the  discussion  in  section  5.  The  idealised  forms  are  (i)  Ti-Al  which  contains  only  the 
three  gamma  variants,  (ii)  Ti-Al  which  contains  alternating  layers  of  a2  and  a  single 
7  variant,  70  and  (iii)  Ti-Al  which  contains  70,  760  and  7120  in  equal  volume  fractions 
/  and  0-2  at  volume  fraction  (1  —  3/).  The  first  is  a  good -approximation  to  many  PST 
specimens  which  contain  a2  concentrations  of  about  lOhigh  strength  alloys  and  the 
third  is  a  realistic  representation  of  lamellar  Ti-Al  since  there  is  no  obvious  reason 
why  one  variant  of  g  should  be  more  common  than  another. 

All  7  alloys 

The  material  may  contain  all  six  variants  of  7  but  the  strains  are  the  same  in 
twinned  pairs  so  only  three  strain  fields  exist.  Since  isotropic  elasticity  is  assumed, 
the  elastic  constants  are  the  same  for  each  lamella.  This  is  a  plane  stress  problem, 
so  P33  =  0  but  inspection  of  the  transformation  strains,  eq.  ??  and  ??,  shows  that 
only  in-plane  shears  exist  therefore  KS3  =  0  also.  Substitution  of  the  transformation 
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strains  ,  ??  and  ??  into  3.1  give  the  following  strains,  K0  in  the  0°  and  180°  variant 
layers,  Kqq  in  the  60°  and  240°  variant  layers  and  K120  in  the  120°  and  300°  variant 
layers: 


-K0  =  h 0 
£ 


r  1  —  1A/3 1 

1  / 

1  l/\/3  ' 

2/3  O' 

H 

1 

H 

1 

+  /120 

1/VS  -1 

0  -2/3 

(4.1) 


-i^GO  =  fo 


-1  l/v/3  ' 

1  / 

0  2/V3  ' 

■  -1/3 

l/\/3  ' 

l/y/3  1 

+  /I20 

0 

% 

_ 1 

.  I/Vs 

1/3  . 

(4.2) 


-K120  =  fo 


-1  -l/y/3  ' 

r  / 

1 

es" 

1 

0 

-1/3 

-l/y/3  ' 

-l/y/3  1 

+/60 

-2/v/3  0 

-l/y/30 

1/3 

(4.3) 


The  strains  on  the  right  side  are  the  limiting  values  when  all  six  variants  have 
equal  volume  fractions,  f0  —  /so  =  /120  =  1/3. 

Since  K33  =  0  in  all  cases,  the  stress  fields  have  identical  forms: 

Pq  =  2fiKo  Peo  =  2/iKso  P120  =  2/i/Ci2o  (4.4) 


In  the  coherent  multilayer,  the  in-plane  lattice  parameters  all  match  each  other. 
Referring  to  Fig.  2,  the  triangles  have  sides  parallel  to  close  packed  directions  and 
the  lengths  of  these  sides  illustrate  the  effects  of  the  coherency  strains:  In  the  70 
variant,  viewed  down  (—1  —  1  —  1)  the  unique  side,  [—110],  has  a  free  length  ay/2 
and  the  two  other  sides  have  lengths  ay/2{  1  +e/2).  In  the  coherent  state,  the  [—110] 
side  assumes  a  length  of  a\/2[l  +  (/so  +  /i2o)e/2],  the  [-101]  side  has  length  ay/ 2 
[1  +  {fo  +  /i2o)e/2]  and  the  [0-11]  side  has  length  ay/ 2[1  +  (fo  +  /so)e/2].  If  fo  = 
feo  =  /120  =  1/3,  all  three  sides  have  the  same  length  ay/ 2(1  +  e/3).  Formally  the 
7  phase  loses  most  of  its  symmetry  in  the  coherent  multilayer  and  becomes  triclinic, 
only  regaining  trigonal  symmetry  when  the  volume  fractions  of  the  variants  are  equal. 
In  the  ’all  gamma’  approximation  to  lamellar  Ti-Al,  the  coherency  strains  and  stresses 
are  entirely  homogeneous  shears  in  the  plane  of  the  lamellae.  The  typical  magnitude  of 
the  shear  stress  is  2/xe/3  or  about  0.9GPa  in  a  fully  coherent  material  with  fx  —  70GPa 
and  e  =  0.02.  The  orientation  of  the  stress  is  such  that  it  exerts  no  force  on  soft  mode 
dislocations  and  can  therefore  not  influence  the  plastic  behavior  of  the  soft  mode. 
Hard  mode  deformation  systems  may  be  either  softened  or  hardened  by  the  coherency 
stresses. 

£*2-  single  7  variant  bilayered  material 

The  material  under  consideration  here  consists  of  alternate  layers  of  the  70  variant 
(Ai  =  0)  and  Q2  .  The  transformation  strain  for  Q2  given  in  eq.  ??  only  has  principal 
strains  A11  and  A22  because  X\  and  x?  are  principal  axes.  If  rotated  axes  had  been 
used  there  would  be  a  shear  strain  A12  and  this  is  included  for  completeness.  The 
transformation  strain  used  is: 


•  An  Ai2 

A21  A22 


(4.5) 
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Subscript  1  refers  to  material  1,  that  is  70,  and  subscript  2  refers  to  material  2, 
that  is  a2.  Writing  M  for  the  biaxial  modulus  2fi(l  +  v)/(l  —  v ),  the  strains  in  the 
two  materials  are: 


K?  =  f  [M2(A11+A22)/M*+M2(A11-A22)/M*] 

^[m2(£'-£/3)/M*+M2£/3M’] 

Kn  =  K 12  =  ^2^12 
1  1 

K?  =  y[M2(A11+A22)/M*-^2(A11-A22)/,x*] 
/  [m2  (£'  -  £/3)  /AT  -  /i2£/3/i'" 


(4.6) 


(4.7) 

(4.8) 


■^i3  —  —  fo.viX  (A11  4-  A22)  /2/x!(1  +  t'l)  (4.9) 

-h^\X  (£  -  £/3 )  //ii(l  +i/j) 

where  Af  is  the  average  biaxial  modulus  f\M\  +  /2Af2,  /i*  is  the  average  shear 
modulus  /ipi  +  f2fi2  and  where  X  =  MiM2/AT.  The  first  forms  in  eq.  4.6-4.9 
are  the  general  forms  and  the  second  are  the  particular  forms  for  7n / o2  i.e  when 
A11  =  £  ,  A12  =  A21  =  0  and  A22  =  (£'  -  2£/3). 

The  strains  in  layer  2  are  obtained  by  interchanging  the  subscripts  on  the  material 
parameters  /,  /i,  u,  M  and  multiplying  by  -1.  The  stresses  in  the  70  layer  of  70/0:2 
are: 


PiU  =  h 


A11  -f  A22  A11  -  A22 

x  — - - r  I  — - - 


/2[x(£'-£/3)+y£/3]  (4.10) 


P?  =  f 2 


A11  +  A22 

x - ? - Y 


P/2  =  p21  =  /2rA12 
A11  -  A22 


/2  X  (V  -  £/3)  -  y£/3 


(4.11) 

(4.12) 


P?' 3  =  0  (4.13) 

where  Y  =  2 ^i^2/^* 

Again,  the  first  form  is  the  general  form  and  the  second  form  is  the  particular  result 
for  7o/o2.  The  stresses  in  layer  2  are  P%0  =  -(h/f2)P?0  as  they  must  be  to  comply 
with  the  zero  volume  average  of  stress  components  rule.  In  the  7o/o2  approximation 
to  lamellar  Ti-Al  there  are  three  types  of  stresses  with  different  origins  (i)  in-plane 
shear  stresses,  caused  by  the  tetragonality  of  7,  (ii)  out-of-plane  shear  stresses  and 
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(iii)  biaxial  tension/compression  stresses  caused  by  the  fact  that  a2  has  larger  lattice 
parameters  than  7. 

The  in-plane  shear  stresses  have  the  same  origin  (the  tetragonality  of  7)  as  above 
and  consequently  the  same  order  of  magnitude.  But  this  stress  is  split  between  the 
phases  and  in  a  typical  Ti-Al  alloy  in  which  the  a 2  volume  fraction  is  0.1,  the  a2  layers 
have  a  stress  of  about  1.6  GPa  and  the  7  layers  a  stress  of  about  0.2  GPa.  However, 
this  conclusion  is  reached  because  the  g  layers  are  all  of  one  variant.  If  there  were  a 
mixture  of  variants,  the  in-plane  shear  stresses  would  be  lower  in  0:2  and  higher  in  7 
(see  below). 

The  out-of-plane  shear  stresses  have  maximum  values  of  Pn/2  and  P22/2.  The 
larger  of  these  is  Pn/2  in  both  layers  because  there  is  a  larger  misfit  between  and 
7o  in  the  xx  direction  than  there  is  in  the  x2  direction.  The  value  of  this  shear 
stress  in  a?  is  about  2.3  GPa  and  in  70  it  is  about  0.3  GPa,  The  reason  for  the 
disparity  is,  as  before,  that  the  minority  phase  strains  more  than  the  majority  phase 
(when  the  elastic  constants  are  comparable)  and  consequently  is  more  highly  stressed. 
The  biaxial  compression  in  the  a2  layers  (and  consequent  biaxial  tension  in  the  7 
layers)  deserves  a  detailed  discussion  because  it  could  have  a  decisive  effect  on  the 
mechanical  properties  of  Ti-Al  alloys.  The  fact  that  P2n  +P222  is  negative  in  a2  layers 
gives  this  material  a  substantial  protection  against  the  propagation  of  a  crack  across 
the  lamella  which  it  would  not  have  in  the  monolithic  form.  The  larger  the  lattice 
parameter  of  £*2  can  be  made  by  alloying,  the  greater  this  protection  will  become. 
The  biaxial  compressive  stress  in  a2  exerts  large  forces  on  dislocations  gliding  on 
the  pyramidal  (but  not  the  basal  or  prismatic)  slip  planes.  Since  dislocation  glide  is 
unaffected  by  hydrostatic  pressure,  it  is  easier  to  think  of  the  biaxial  compression  as 
an  effective  uniaxial  tension  along  the  basal  plane  normal:  the  average  compression  is 
(P| 1  4-  P22)/2.  If  a  pressure  of  this  magnitude  is  subtracted  from  each  of  the  normal 
stresses  in  a 2  the  effective  stresses  n  are 

ttIX  _  P211  ~  tt22  _  P22  —  Pj1  tt33  _  P“  +  P%2 

2  2  2 

n33  is  the  effective  tensile  stress  along  [001].  Its  magnitude  is  approximately  4.2/i 
GPa.  There  is  a  compensating  compressive  stress  of  —4.2/2  GPa  in  the  7  layers.  When 
/,  »  f 2  the  very  large  effective  tensile  stress  in  q2  must  make  it  much  more  easily 
deformable  in  tension  along  [0001]  than  in  compression  and  PST  specimens  deformed 
in  the  N  orientation  would  be  expected  to  show  a  substantial  tension/compression 
asymmetry  if  slip  on  the  pyramidal  planes  of  Q2  is  important  in  the  deformation. 

a2-  three  7  variant  alloys 

In  this  example  the  three  (or  six)  variants  of  7  are  equally  represented  with  total 
volume  fraction  3/,  the  remaining  (1  —  3/)  being  02  •  Layer  1  is  70  and  7is0)  layer  2 
is  760  and  7240,  layer  3  is  7120  and  7300  and  layer  4  is  o2- 

The  strains  are: 

Ki  =  K  +  l  l  ^  (4-14) 
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The  stresses  are 

a -X ?]+/*.«[T  —2/3  ]  (418) 


p2  =  X  (l  —  3/)  (e  —  e/3^ 

i  r  i  o 

)  o  1  +^l£ 

'  -1/3 

.  i/vS 

l/VS  ' 

i/3 

(4.19) 

P3  =  X(l~3f)(e'  -e/3) 

0  1  ] +/il£ 

-1/3 

-l/x/3 

-1/V3  ' 

1/3 

(4.20) 

Pi=-X(  1 

-3/)(e'-e/3)  | 

10' 
0  1 

(4.21) 

These  strains  and  stresses  have  some  of  the  features  of  both  the  previous  exam¬ 
ples.  The  Tvariants  may  be  thought  to  reach  coherency  in  two  stages:  First  they 
shear  by  the  same  amount  as  in  eq.  ??,  reaching  three-fold  symmetry  because  the 
volume  fractions  of  the  three  variants  are  equal;  second,  they  undergo  a  biaxial  stretch 
represented  by  K.  The  a2  layers  suffer  a  biaxial  compression  which  is  large  when  / 
is  close  to  1/3  i.e.  when  a2  is  a  minority  phase.  At  coherency,  the  lattice  parameters 
in  the  x3  plane  are  equal,  with  value  a\/2  =  (1  +  K\l)  : 

ac  =  aV§[l  +  3/Mi  |  +  (1  -  3 (4.22) 


which  is  the  weighted  mean  of  the  individual  lattice  parameters,  the  weighting  factor 
being  M.  Explicitly, 

0>2 

Qc  =  W[fMl  +  /Ml(1  +  £/2)  +  fM i (1  +  £/2)  +  (!  “  3/)M2(  1  +  e')\ 

which  is  the  same  as  4.22.  The  result  in  4.22  is  the  most  compact  form  of  the  elastic 
solution  to  the  entire  multilayer  since,  from  it,  the  strains  of  4.14-4.17  may  be  obtained 
directly  and  the  stresses  of  4.18-4.21  may  be  obtained  from  ??  by  Hookes  law. 
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4.1.  Discussion 

The  stresses  and  strains  in  a  fully  coherent  7/02  lamellar  Ti-Al  alloy  depend  on  the 
crystallographic  mismatches  e,  elastic  constants  and  volume  fractions  of  the  7  variants 
and  (*2.  Experimentally,  the  three  variants  of  7  are  usually  equally  common  and  the 
volume  fraction  of  o2  lies  between  0.1  and  0.5.  Both  e  and  e  depend  on  composition. 
The  7  phase  is  close  to  stoichiometry  [26]  ,  [?]  and  e  is  measured  to  be  close  to  0.02 
at  room  temperature  [8].  e'  is  less  accurately  known  since  the  composition  of  £*2  is 
far  from  stoichiometric  Ti3Al  .  In  our  estimates  we  have  assumed  it  to  be  0.021,  as 
given  by  one  set  of  measurements  [23] .  Changes  of  temperature  can  change  e  and  e 
because  expansion  coefficients  in  the  basal  plane  of  0-2  and  along  x  and  z  in  7  are  all 
different. 

Taking  e  to  be  0.02  and  e  to  be  0.021,  the  most  significant  stresses  are  in-plane 
shears  (proportional  to  e)  in  all  the  7  lamellae  of  about  0.9  GPa  and  a  biaxial  com¬ 
pression  (proportional  to  e  —  e  /3)  of  between  3.8  GPa  and  2.1  GPa  in  02  and  a 
compensating  biaxial  tension  in  7  of  between  0.4  GPa  and  2.1  GPa.  .The  biaxial 
stresses  vary  between  these  limits  as  the  volume  fraction  of  0:2  varies  from  0.1  to  0.5. 

These  estimates  of  the  internal  stresses  are  upper  limits  because  Ti-Al  has  lamel¬ 
lae  which  are  not  thin  enough  to  be  fully  coherent.  Much  of  the  stress  is  relaxed 
by  interfacial  dislocations  and  the  measured  stresses  are  one  to  two  orders  of  mag¬ 
nitude  smaller  than  the  coherent  estimates  above  [13],  [2].  To  our  knowledge,  there 
is  no  detailed  theory  to  determine  the  coherency  limit  in  a  multilayer,  especially  one 
containing  lamellae  with  a  variety  of  thicknesses.  But  the  Frank  and  van  der  Merwe 
theory  [9]  can  be  used  to  make  an  estimate  of  the  maximum  lamellar  thickness  Hc  in 
a  coherent  multilayer  and  also  the  factor  by  which  the  stresses  are  reduced  in  partially 
coherent  multilayers.  The  theory  shows  [13]  that  in  Ti-Al,  Hc  is  approximately  30nm 
and  in  lamellae  which  are  thicker,  the  stresses  are  reduced  by  a  factor  Hc/H.  In 
PST  material,  the  lamellae  are  typically  lpm  thick  and  the  stresses  are  consequently 
reduced  from  the  coherent  values  by  a  factor  of  about  30,  in  line  with  experiment.  In 
recently  developed  high  strength  alloys  the  lamellae  are  typically  100  nm  thick  and, 
in  them,  the  stresses  are  predicted  to  be  reduced  from  the  coherent  values  by  only  a 
factor  of  3.  The  coherency  stresses  in  high-strength  lamellar  Ti-Al  alloys  are  therefore 
certainly  sufficient  to  contribute  significantly  to  their  yield  stress  of  approximately  1 
GPa. 


4.2.  Conclusions 

1. General  formulae  are  given  for  the  stress  and  strain  tensors  in  any  layer  of  a  co¬ 
herent  multilayer  which  may  contain  any  number  of  different  layers,  with  different 
thicknesses  and  different  elastic  constants.  The  restrictions  in  the  calculation  are 
that  isotropic  elasticity  is  assumed,  only  in-plane  mismatches  are  used  and  free  sur¬ 
face  effects,  though  taken  into  account,  are  not  included  in  the  formulae. 

2.  At  coherency  all  the  lattice  parameters  match  so  each  layer  of  a  particular 
material  or  variant  has  undergone  the  same  strain  and  contains  the  same  stress.  In  an 
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ideal  Ti-Al  multilayer  there  are  only  four  coherency  stress  tensors  and  four  coherency 
strain  tensors  in  the  whole  multilayer. 

3.  In  a  bimaterial  multilayer,  each  stress  component  in  one  material  is  proportional 
to  the  volume  fraction  of  the  second  material.  In  Ti-Al  the  minor  phase  is  a2  and 
consequently  the  stresses  and  strains  are  higher  in  this  phase  than  in  7. 

4.  In  a  Ti-Al  multilayer  containing  only  g  variants,  the  in-plane  lattice  parameters 
are  the  averages  of  the  individual  7  lattice  parameters.  When  the  volume  fractions 
of  the  variants  are  equal,  the  coherent  multilayer  shows  three  fold  symmetry  on  the 
layer  plane. 

5.  The  in-plane  shear  stresses  in  Ti-Al,  caused  by  the  tetragonality  of  the  unit 
cell,  do  not  influence  the  soft  mode  of  deformation  but  do  exert  forces  on  hard  mode 
dislocations. 

6.  In  7/ a2  coherent  multilayers  the  lattice  parameter  is  the  weighted  mean  of  the 
individual  lattice  parameters,  the  weighting  factor  being  M,  the  biaxial  modulus. 

7.  In  7/a2  multilayers  large  biaxial  compressive  stresses  exist  in  the  a2  layers. 
These  stresses  tend  to  prevent  crack  propagation  through  »2  and  they  promote  the 
tensile  plastic  deformability  of  a2  normal  to  the  lamellar  plane. 

5.  STRESSES  IN  LAMINATED  ELASTIC  STRUCTURES  WITH 
MISFIT  (general  thermodynamic  study) 

5.1.  The  notion  and  geometry  of  misfit  deformation 

In  this  Section  we  give  a  simplified  linear  description  of  elastic  heterogeneous  systems 
with  coherent  interfaces  and  misfit  strains  (see  Dimiduk  et  al  [6],  [7],  Hazzledine  et 
al  [13],  Kad  et  al  [15]  for  the  crystallographic  description  and  physical  motivation, 
and  Grinfeld  [11]  for  the  more  general  situation  and  a  more  detailed  description  of 
the  mechanics  of  heterogeneous  solids  with  finite  misfit  deformations).  Let  us  choose 
one  of  the  unstressed  configurations  of  the  crystalline  substance  to  be  the  reference 
basis  for  the  lamellae  elastic  structure.  In  the  reference  configuration  we  choose  some 
affine  coordinate  system  x{;  the  Latin  indices  run  the  values  1,2,3;  the  summation 
convention  over  repeated  indexes  is  implied  in  what  follows.  We  remind  the  reader 
that  each  affine  coordinate  system  has  straight  coordinate  axes  which  are  not  nec¬ 
essarily  perpendicular  to  each  other  (in  particular,  any  Cartesian  coordinate  system 
is  necessarily  affine):  in  the  following  we  specify  the  choice  of  a  specific  coordinate 
system  which  is  the  most  convenient  for  the  structure  in  question. 

We  denote  by  n‘  the  components  of  a  unit  vector  which  is  perpendicular  to  the 
planes  along  which  the  lamellae  match  each  other.  The  total  displacement  within 

L-th  each  of  the  lamellae  U{  (xk)  can  be  splitted  into  two  parts 

L 

Ui  (xk)  =  Ai:i  xj+  m  (xk)  (5.1) 

£  L  L 
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where  the  first  term  Ay  xJ  is  associated  with  the  misfit  deformations  whereas  the 
L 

second  ty  (xk)  is  associated  with  the  additional  elastic  deformations.  We  assume 
L 

that  the  misfit  strain  Ay  itself  does  not  generate  any  stresses  in  the  sense  that  under 
L 

this  strain  the  isolated  lamellae  does  not  accumulate  any  elastic  energy.  The  misfit 
strain,  however,  is  the  indirect  source  of  huge  elastic  stresses  within  the  laminated 
structure  since  the  strains  Ay  differ  very  significantly  (as  compared  with  conventional 

L 

elastic  deformations)  within  different  lamellae.  That  difference  in  the  misfit  strain  for 
different  lamellae  triggers  paramount  elastic  strains  dj  ty  (xk)  and  stresses  Pybecause 

L  Li 

of  the  requirement  of  coherency  across  the  interfaces  between  different  lamellae.  In 
what  follows  the  coherency  is  implied  as  the  continuity  of  the  total  displacement 
across  the  interfaces.  Using  the  notation  (-A]£+1  of  the  jump  of  the  quantity  A  across 
the  interface  separating  the  lamellae  numbered  as  K  +  1  and  K  we  can  present  the 
coherency  condition  as  follows: 

Mk+1  =  +  ty(**)]£+1  =  0  (5.2) 

If  we  choose  the  coordinate  system  in  such  a  way  that  the  coordinate  planes  x1  — x2 
(i.e.,  the  plane  x3  =  const  )  coincide  with  the  interfaces  between  the  lamellae,  the 
equation  5.2  appears  to  be  equivalent  to  the  following  one: 

[Ay+  a,«i(xfc)]J+14  =  0  (5.3) 

Here  and  in  what  follows  the  Greek  indices  run  the  values  1,2  and  the  summation 
convention  over  repeated  Greek  indices  is  implied;  the  ’’shift”  symbol  6^  is  equal  to 
1  if  i  =  a  and  it  is  equal  to  zero  otherwise. 

The  equality  of  the  forces  across  the  coherent  interface  can  be  written  as 

[p<,]£+1n;  =  °  (5-4) 

The  assumption  of  the  linear  elasticity  gives  the  following  formula: 

pij=  cm  gt  Uk  (5.5) 

L  L  L  . 

where  the  elasticity  tensors  are  symmetric  with  respect  to  the  permutation  of  the  in¬ 
dices  c«w=  cjikl=  cklii=  ...  Combining  5.4,  5.5,  we  arrive  at  the  following  equations: 
L  L  L 

[ci*kldluk]*+1nj=0  (5.6) 

Within  the  bulk  of  each  lamellae  the  following  equilibrium  equation  should  be 
satisfied: 

Cijkl  dkdt  uk=  0  (5.7) 

£  L 
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The  system  of  equations  5.3,  5.6,  and  5.7  allows  to  find  equilibrium  displacement 
within  each  of  the  lamellae  Ui  (xk)  provided  the  appropriate  mechanical  conditions 

L 

at  the  external  interface  is  specified. 

The  equilibrium  displacements  depend  significantly  on  the  ’’misfit”  strains 

and  the  elasticity  tensors  clikl.  The  following  two  experimental  observations  allow  to 
simplify  the  system  of  equilibrium  equation  significantly: 

i) the  elasticities  c'-^are  the  same  within  each  of  the  lamellae;  in  the  following 

consideration  we  use  for  them  the  notation  ci,w, 

ii) only  the  in-plane  components  of  the  misfit  strain  A{j  deviate  from  zero; 

hence  L 

&U  =  Aap  6? 6?,  Aia  =  0  (5.8) 

L  L  Lt 

5.2.  Some  features  of  the  stresses  in  laminated  structures  with  misfit 

The  TiAl  alloys  often  have  a  laminated  structure  with  the  laminae  having  the  in¬ 
plane  dimensions  much  greater  as  compared  with  their  thickness.  Because  of  this  very 
feature  some  general  conclusions  regarding  the  equilibrium  stress-strain  fields  can  be 
made  in  advance  (these  conclusions  can  be  made  on  the  basis  of  both:  the  experiment 
and  relevant  asymptotic  analysis  of  the  elasticity  equations).  The  equilibrium  fields 
can  be  splitted  into  two  parts:  i)the  uniform  stress-strain  fields  on  certain  distance 
from  the  external  boundary  and  the  external  edges  comparable  with  the  thicknesses  of 
individual  lamellae  (in  the  following  we  name  it  the  ’’internal”  field),  and  ii)the  edge 
distortions  of  the  above  "internal”  field  which  appear  to  be  very  significant  in  the 
vicinity  of  the  external  interfaces  (in  the  following  we  name  it  the  ’’external”  field). 
The  conditions  at  the  external  interface  cannot  change  the  very  fact  of  the  uniform 
character  of  the  ’’internal”  field  though  they  can  change  the  specific  parameters  of 
the  internal”  field;  needless  to  say,  that  the  conditions  at  the  external  surface  play  a 
crucial  role  in  the  formation  of  the  ’’external”  stress-strain  field.  Both  -  the  ’’external” 
and  the  ’’internal”  fields  can  play  significant  roles  in  different  aspects  of  the  behavior 
of  the  lamellae-like  alloys. 

5.3.  Uniform  misfit  stresses  of  the  ’’internal”  field 

5.3.1.  General  system  of  equations  defining  strain-stress  field 

The  equilibrium  ’’internal”  uniform  displacements  within  the  plates  u{  (xk)  can  be 

written  as  follows:  L 

m  (xk)  =  mj  x?  (5.9) 

L  L 
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where  is  the  unknown  matrix  of  the  equilibrium  gradients.  Inserting  5.9  in  the  equa¬ 
tions  5.3  and  5.6  we  get,  respectively: 


(5.10) 


[cijklKkl]*+1  Tlj  =  0 


(5.11) 


The  system  of  algebraic  equations  5.10-5.11  includes  less  equations,  than  the 
amount  of  the  unknowns  k ij.  This  system  includes  only  the  demands  based  on  the 

L 

mechanical  equilibrium  and  coherency  at  the  interfaces.  To  make  the  system  self- 
sufficient  certain  additional  physical  ideas  should  be  implemented.  A  deeper  analysis 
shows  that  there  are  several  ’’natural”  extensions  of  the  system  5.10-5.11.  Here  we 
shall  discuss  only  two  extensions  which  are  of  interest  for  TiAl  multilayers.  One  of 
them  concerns  the  ’’misfit”  stresses  in  multilayers  grown  on  a  thick  substrate,  the  other 
-  the  ’’misfit”  stresses  in  the  multilayer  which  are  free  from  any  external  support.  In 
the  former  case  the  substrate  appears  to  be  unstressed  because  of  its  thickness:  we 
refer  the  misfit  stresses  in  this  case  as  the  ” on-substrate”  misfit  stresses.  We  coun¬ 
terpose  these  ” on-substrate”  misfit  stresses  to  the  ’’free”  misfit  stresses  related  to  the 
latter  case. 

The  case  of  the  ’’free”  misfit  stresses  is  more  difficult  for  the  calculations  and  it  will 
be  studied  first.  For  this  very  case  and  periodically  repeated  stocks  of  the  lamellae 
the  appropriate  asymptotic  analysis  shows  that  for  the  following  equation  should  be 
satisfied  [24]: 

N  N 


(5.12) 


where  H  is  the  thickness  the  A'-th  lamella  within  the  period. 

K 

The  6 (N  —  1)  equations  5.10,  3 (N  —  1)  equations  5.11,  and  9  5.12  form  a  closed 

system  of  equations  with  respect  to  9 N  unknowns  Kki-  By  using  the  equation  5.10 

K 

we  can  find  3 N  unknowns  corresponding  to  the  skew-symmetric  (rotational)  part  of 
the  gradients  Calculation  gives  the  following  formulae: 

K 


[KI*il]jc  —  +  2  [*(*«)] K  nk6\inj]  (5.13) 

The  equation  5.13  allows  to  calculate  the  equilibrium  rotations  k  within  each 

Hii 

individual  lamellae  provided  the  equilibrium  deformations  n  have  already  been 

L(ka) 

determined  from  some  other  equations  exhibited  below.  The  equation  5.13  shows  that 

i) the  equilibrium  rotation  within  the  laminae  are  actually  generated  by  the  rotations 
in  the  misfit  deformations  and  by  the  rotations  associated  with  the  deformations,  and 

ii) the  equilibrium  displacements  can  be  determined  to  within  an  arbitrary  rotation  of 
the  sample  as  the  whole. 
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For  the  6N  unknowns  corresponding  to  the  symmetric  (deformation)  part  of  the 
gradients  we  find  the  following  6 N  equations: 

K 


(5.14) 

E  ?/%>•> = 0 

k=ik  K 

(5.15) 

(5.16) 

The  equation  5.10  gives  us,  in  particular 

K(«i)  nj  =  0 

L 

(5.17) 

and,  hence,  the  formula  of  the  equilibrium  rotation  can  be  simplified  to  read 

hdri  =  [AMi]J+1^f 

(5.18) 

According  to  the  equation  5.18  the  equilibrium  rotation  is  generated  by  the  rota¬ 
tion  in  the  misfit  only,  and  not  by  the  equilibrium  deformations. 


5.4.  Misfit  stresses  in  the  ’’free”  7/7  plate-like  periodic  structure  with 
isotropic  moduli 

In  the  case  of  the  7/7  lamellar  structure  the  elastic  moduli  of  the  lamellae  occur  to 
be  equal,  and  the  master  system  5.15-5.16  can  be  simplified  significantly  to  read: 

°ijkl  [*(*')]  £+1  nj  =  0  (5.19) 

N 

E  %>Hki)  =  0  (5.20) 

K=  1  K  * 

Using  the  equations  5.14,  5.19,  5.20  we  find  the  following  formula  of  the  equilibrium 
stresses  P,jf  within  the  L-th  lamella  of  the  7/7  laminated  structure  for  the  case  of 
isotropic  lamellae: 

f  “jEf  (rV"4*'  +  26“‘e“  ~  (*m  -  4jo)  (5.21) 

where  /i  is  the  shear  modulus  of  the  substance.  Simple  analysis  shows  that  for  the 

case  under  consideration  the  in-plane  stresses  Pa/3only  ( a  and  (3  run  the  values  1  and 
.  L 

2)  can  assume  non-zero  values.  For  those  very  components  the  formula  5.21  can  be 
simplified  to  read 

r  =  **  +  26°16lX)  (A  W)  -  %l)  (5-22) 
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5.5.  Misfit  stresses  in  the  ’’free”  a/ 7  -  periodic  structure  (different  elastic 
moduli) 

5.5.1.  a/7  -  periodic  structure  (2-layers,  isotropic  elastic  lamellae) 

When  considering  the  case  of  different  elastic  moduli  we  use  the  general  system  5.14 
-  5.16.  First,  we  consider  an  analytical  solution  for  the  case  of  the  isotropic  elas¬ 
tic  moduli  and  a  periodic  cell  consisting  of  two  lamellae:  the  one  having  the  a-like 
structure  and  the  other  having  the  7-like  structure.  A  routine  calculation  gives  the 
following  result  for  the  stresses  within  the  lamellae: 


2  MM/f 
1  2  2 

M/i  +  Mii 
11  2  2 


A?7  -  A77  6Qf3  + 
2  1  / 


2  MM/i  . 

- 122  ( A(a/})  -  A(a/J)V5.23) 

M/f  +  M/i  V  2  1 

11  22 


2  WH  (  \  2  WH  . 

- R _ - (  AT,  -  AT,  )  6of3 - (a(q/3)  -  A<“0) 

^  Mii  +  Mif  \  o7  1  /  M/f  +  Mi/V  2  1  ) 

11  2  2  \  '  11  22 


where  the  constant  R  is  defined  as 


Mi/  (i+3c 

1  1  V  1 


C)C  +  VH  1  +  3(7  C 
1/2  22  \  2/1 


M/i  (l  +  3C^  (1+?)  +  (1  +  3  2”)  (1+</) 


r  =  M 

K  ~  1-2  1/ 


The  in-plane  deformations  within  the  lamellae  k  are  equal  to 

Af 


(5.24) 


(?  -  ?)  (t,-  -  +  with  K"  -  Alrw>5-25) 

*■  (?  -  ?)  -  T-)  ^  -  ifi™  (ar  -  ar ) 


where 


MM/iii 

1212 


M/f  (1+3?)(l+?)  +  MFT  ^1  +  3  C  )H) 

For  the  out-plane  deformation  components  k  (33)  we  get  the  formulae 

M 


(5.26) 


(5.27) 


k<33>  = 

1 


(1  +  3C')  CM/i 

_ \ _ 2/122 _ 

MJT  ^1  +  3  C^j  ^1+  cj  +  l*H  ^1  +  3  c'j  ^1+  C^j 


a;  -  at7 
2  1 
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k<33>  = 
2 


(i  +  3  c)  cm 


?f(i+3f)(i+?)+?f  (i+3?)(i+?) 


A2,  -  to, 

\  2  1  , 


All  other  components  of  the  deformation  appear  to  be  zero. 

If  the  Poisson  ratios  of  the  phases  are  equal  -  i/  =  v  =  v  -  the  formulae  5.24  -  5.27 

allow  some  simplifications;  in  particular,  we  get  the  following  formula  of  R  in  this 
case: 


RP  = 


l-i/ 


(5.28) 


If,  in  addition,  the  shear  moduli  are  equal  -  U  -  the  formulae  5.23  of  the 

ii 

stresses  Va"  reads: 

M 


2  pH 

paP  _  _ 

i  H  +  H 

1  2 

2  pH 


(5.29) 


,  /  \  2  pH 

- -  A7  -  A7  8°#  +  - -  A<“^ 

V  2  1/  # +  f  V  2  1  / 

2^H  i,  /  \  2/itf 

P®'3  = - —  - -  I  A7  -  A7  <5“P - 3—  f  A(“P>  -  A<“P>') 

2  tf  +  tfl-i/^27  p)  H  +  h\  2  Ai  J 

The  formulae  5.29  appear  to  be  in  agreement  with  the  formulae  5.23  as  they 
should. 


5.5.2.  a/7  -  periodic  structure  (N- layers,  anisotropic  elastic  lamellae) 

The  analysis  of  this  most  general  case  is  based  on  the  system  5.14  -  5.16.  It  is 
convenient  to  introduce  the  following  matrices: 


L  k=n  ap  k=n  .  . 

T%Z=  E  Vf016?  ’  R  =  E  Hia^6  (A  -  A  )  (5.30) 

kZ 1  kK  K^i  KK  V«'(7«)  l  hi)) 


where  s  are  the  ’’partial  1-2”  part  of  the  compliance  tensor  s  ;  caPisis  the 
.  .  Lijki  L 

inverse  of  s  ,  i.e.,  the  tensor  satisfying  the  relationship 

L'yfrjVUJ 


rwu»=^4">  (531> 

In  terms  of  these  ’’material”  tensors  the  stress  tensor  within  the  L-th  lamella  Pal3 
can  be  calculated  with  the  of  the  following  linear  system  of  equations: 

l  L<*p 

Tg  R  (5.32) 

Li 
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We  remind  the  reader  that  in  the  equations  5.30  -  5.32  the  indices  run  the  values 
1  and  2  only  (not  3!). 

The  above  program  can  be  fulfilled  explicitly  in  the  case  of  isotropic  lamellae. 
Corresponding  calculations  give  the  following  formula  of  the  stresses  within  the  L-th 
lamella  of  the  spatial  period  including  N  lamellae: 


p<*P 

L 


2  v  PT  —  (l 

-  v)  FF 

MML  V 

M  /  LL  faff  £ 

+  f 

/  \ 

/  \  0  Ofl-of 

(1-  v  )2T 

(T  +  F) 

L 

_\  Ml  L 

\l  lJ 

• 

(5.33) 


where  T  and  F  are  defined  as  follows: 
L  L 


T  = 

L 


,  N  F 

jEf  f. 

z  K= iK  7 


(5.34) 


5.6.  Misfit  stresses  in  the  ”  on-substrate”  periodic  multilayers  with  differ¬ 
ent  elastic  moduli 


The  case  of  the  ”  on-substrate”  multilayer  is  of  the  primary  interest  in  many  appli¬ 
cations,  especially,  those  related  to  the  deposition  of  thin  films,  though  they  are  of 
a  secondary  interest  in  concerns  the  TiAl  lamellar  structure,  and  we  pay  much  less 
attention  to  it  in  this  report.  In  the  ” on-substrate”  case  the  extremely  thick  substrate 
does  not  deform  practically  and  it  can  be  treated  as  if  it  is  rigid.  Because  of  this  fact 
all  the  monolayers  of  the  multilayer  are  ’’aware”  of  presence  of  the  substrate  only, 
and  they  ’’know”  nothing  about  the  existence  of  other  monolayers.  Therefore,  we  can 


rewrite  the  system  5.14,  5.15 


K  —  ~  A(cr/?) 

M(a0)  Mv 


(5.35) 


cijklK  n,  =  —  c  *jfc<  A  n,-  (5.36) 

M  M{kl)  3  M  M(kl) 

where  if  the  ’’misfit”  with  respect  to  the  lattice  of  the  substrate  A  •  The  system 

M(kl) 

5.35,  5.36  appears  to  be  closed  with  respect  to  the  unknown  misfit  deformations  k 

M(kl) 

It  shows,  in  particular,  that  the  stresses  do  not  depend  of  the  lamellae’s  thicknesses. 
In  the  case  of  isotropic  lamellae  we  arrive  at  the  following  misfit  deformations  and 
stresses: 


k  =  0, 
M(a  3) 


K 

Af(33) 


VM 

1  —  I'M 


AQa 

M 


(5.37) 
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M(^)~  “2/iM  ^  + 


(5.38) 


6.  ROLE  OF  VACANCIES  IN  SLOW  EVOLUTION  OF  IN¬ 
TERFACES 

6.1.  Introduction 

The  elastic  misfit  stresses  in  TiAl  alloys  assume  huge  level  of  the  order  of  the  product 
of  the  elastic  moduli  times  misfit  deformations.  The  elastic  energy  associated  with 
these  stresses  is  enormously  large  also.  Hence,  there  appears  a  driving  force  for  the 
rearrangement  of  the  species  leading  to  the  evolution  accompanied  by  the  creation 
of  dislocations,  migration  of  vacancies,  development  of  interfacial  corrugations,  etc... 
The  goal  of  this  Section  is  to  develop  a  thermodynamic  approach  allowing  one  to  study 
the  possibility  of  stress  relaxation  by  means  of  mass  rearrangement  in  the  vicinity  of 
the  interface  between  the  lamellae.  We  assume  that  the  rearrangement  is  accompanied 
by  interfacial  diffusion  and  migration  of  the  vacancies.  Since  the  full  consideration  of 
the  coherent  interface  solid-solid  includes  lots  of  calculation  and  details  we  illustrate 
our  results  and  approach  by  considering  the  much  simpler  but  conceptually  a  very 
close  problem  of  mass  rearrangement  in  the  vicinity  of  a  stress-free  interface. 

Mass  transport  through  the  bulk  of  solids  is  produced  by  the  migration  of  va¬ 
cancies  whereas  near  surfaces  the  migration  of  vacancies  is  accompanied  by  surface 
diffusion,  vaporization-sublimation,  etc.  (see,  for  instance,  Christian  [5]).  The  sim¬ 
plest  and  most  robust  features  of  the  mechanism  of  surface  diffusion  has  been  studied 
by  Mullins  [20],  It  is  widely  accepted  that  this  mechanism  dominates  over  migration 
of  vacancies  near  traction-free  surfaces.  The  situation  can  change  dramatically  at  the 
nano-scale  (see,  for  instance,  Lee  and  Bennet  [19]).  At  this  level  the  very  applica¬ 
bility  of  common  thermodynamics  seems  questionable.  Since  at  the  nano-scale  it  is 
difficult  to  distinguish  between  the  bulk  and  the  surface  part  of  the  solid  it  seems  nat¬ 
ural,  at  least,  to  withdraw  existing  macro-  and  micro-scale  ’’obvious”  pre-assumptions 
and  to  explore  the  mechanism  of  interaction  between  surface  diffusion  and  vacancy 
migration. 

6.2.  The  master  system  of  slow  evolution 

In  this  note  three  dissipation  mechanisms  are  taken  into  account:  i)dissipation  from 
the  flow  of  vacancies  at  the  surface,  ii)dissipation  from  bulk  diffusion  of  vacancies, 
iii)dissipation  from  surface  diffusion.  In  addition  to  the  bulk  energy  <p  the  Herring¬ 
like  surface  energy  [14]  is  taken  into  account.  We  consider  a  slightly  corrugated  solid 
film  of  thickness  H  attached  to  a  rigid  substrate  (see,  Fig.  3).  It  is  assumed  that 
the  vacancies  can  penetrate  the  bulk  through  the  free  interface  E  of  the  film  only. 
We  denote  the  time  by  t,  the  spatial  coordinates  by  xk  (the  spatial  -  Latin  -  indices 
run  1,2,3,  and  summation  over  repeated  indices  is  implied).  %(x*,t)  is  the  volume 
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concentration  of  vacancies  and  Q'(xk,  t)  is  the  vector  of  the  volume  flux  of  vacancies. 
We  denote  the  surface  coordinates  of  the  free  surfaces  by  (the  surface  -  Greek  - 
indices  run  1,2);  c(f^,t)  and  g“(f0,t)  are  the  normal  velocity  of  the  free  surface  and 
the  surface  flux  of  the  species,  respectively.  Simple  kinematics  (Mullins,  1957  [20]; 
Christian,  1975  [5];  Grinfeld,  1994  [12];  Berdichevsky  et  al.  1996  [4])  leads  to  the 
following  equations  of  mass  conservation  within  the  bulk 


dx  _  dQ* 

dt  dx' 


(6.1) 


and  at  the  free  surface 


c(l  ~x)  =  Qini  +  ^  (6.2) 

where  nj(£^,  t)  is  a  unit  normal  to  the  surface.  The  first  term  on  the  right  hand  side  of 
Eq.  6.2  describes  the  migration  of  the  free  surface  from  the  flow  of  vacancies  through 
the  surface,  while  the  second  describes  the  migration  of  the  surface  from  the  surface 
diffusion. 

In  this  ’’kinematic  surrogate”  of  a  complete  thermodynamic  theory  we  ignore  the 
role  of  stresses  and  heat  and  suppose  that  the  energy  per  unit  volume  <j>  is  a  function 
of  the  vacancy  concentration  x>  whereas  the  surface  energy  per  unit  area  is  a  positive 
constant  a.  The  simplest  ’’constitutive”  equations  of  the  volume  and  surface  fluxes 
compatible  with  the  requirement  of  energy  dissipation  read: 


b  . 


d  d<j> 
dxi  dx 


(6.3) 


D*p 


d  <f>  —  <TK 
W  1-X 


(6.4) 


where  D'£  and  Da^  are  the  bulk  and  surface  positive  symmetric  diffusion  tensors, 
respectively;  k  is  a  doubled  mean  curvature  of  the  surface.  The  simplest  equation 
to  describe  the  flow  of  vacancies  Q'rii  through  the  free  surface  compatible  with  the 
principle  of  energy  dissipation  reads 


Q'rii  —  —Di 


<ft+(l  -x)<j>x-aK 

i-x  • 


(6.5) 


where  £)/  is  a  positive  constant  characterizing  the  resistance  of  the  solid  substance  to 
penetrating  of  vacancies;  <px  =  d<p/dx-  For  the  sake  of  simplicity  we  assume  that  no 
vacancies  flow  through  the  film-substrate  interface  E: 


Qini  =  0  (6.6) 

The  equations  6.1  -  6.6  form  a  closed  self-consistent  nonlinear  system  of  the  differ¬ 
ential  equations  and  boundary  conditions  for  describing  slow  evolution  of  deformable 
elastic  solids  with  vacancies. 
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We  denote  by  x°  the  equilibrium  concentration  of  vacancies  within  the  film  when 
it  has  a  plane  free  surface  and  is  exposed  to  vacuum.  Hence,  according  to  Eq.  6.5, 
X°  satisfies  the  equation 

0(xo)  +  (l-X°)Mx°)  =  O  (6.7) 


6.3.  Rate  of  change  of  small  disturbances 


Let  us  assume  now  that  the  free  surface  E  is  slightly  corrugated.  The  evolution  of 
this  corrugation  can  be  explored  by  linearizing  the  equations  6.1  -  6.7  in  the  vicinity 
of  the  uniform  configuration  with  further  consideration  of  the  solutions  of  the  form 
~  exp {r]t  +  ika£a);  here  are  the  Cartesian  coordinates  on  the  equilibrium  plane 
surface,  ka  is  the  in-plane  wave-number  of  the  corrugation,  r)  is  the  growth-decay  rate 
of  the  corrugations. 

In  the  following  we  assume  that  the  film  is  isotropic  and  contains  a  small  concentra¬ 
tion  of  vacancies  x°  "C  1.  Hence,  the  diffusion  tensors  take  on  the  form:  =  Df,6 *J, 

D013  =  Ds6aP ,  where  Dt,  and  Ds  are  positive  constants  whereas  <5,J  and  6nf}  are  the 
spatial  and  surface  Kronecker  symbols.  A  routine  computation  results  in  the  following 
dispersion  equation  for  this  case: 


(r?  +  Dbo  fc4)  +  DbDjcrk3  =  0  (6.8) 

where  N  =  d^4>{x°)/9x2- 

Eq.  6.8  shows  that  the  real  77  are  necessarily  negative  as  they  should  be  in  the 
absence  of  destabilizing  factors. 

If  i}/(k2N Db)  «  1  we  can  derive  from  Eq.  6.8  the  following  dispersion  relation: 


Dtk  tanh  (*"^+*3^)  +  D, 


—  at 3  tanh  (fcfiQ  +  DjDak  +  DbDi 

^  Dbk  tanh  (kH)  -f  Dj 


(6.9) 


Eq.  6.9  shows  clearly  that  the  three  dissipation  mechanisms  are  not  additive  but 
interact  with  each  other.  It  is  quite  instructive  to  extract  from  Eq.  6.9  Mullins’ 
fc4-term  by  rewriting  the  equation  in  the  form 


T]  =  — D3ak 4 


^ftanh  (kH).+  %± 


Db 


(6.10) 


The  second  terms  in  the  brackets  gives  a  correction  to  the  ”fc4-law”  which  might 
even  dominate,  depending  on  the  magnitudes  of  the  nondimensional  numbers  Dek2/ Dj , 
DBk/Db,  kH.  First  of  all  the  assumption  T]/(k2NDb)  <  1  gives  for  the  Mullins  mode 
the  following  short  wave-length  limitation:  D8ak2/NDb  <C  1.  The  correction  term 
approaches  zero  at  Dsfc2/£>/  »  1  or  Dsk/Db  »  1.  For  thin  films  the  asymptote 
kH  — *  0  is  often  of  interest.  In  this  limit  Eq.  6.10  reads 


rj  =  - Daak 4 


(6.11) 
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and  for  sufficiently  long  wave  corrugations  with  k  1  >  Da/Db  the  correction  term  is 
comparable  with  the  Mullins  &4-term.  The  fc3-term  appears  to  dominate  as  k  — ►  0 
and  in  this  limit  the  rate  7/  approaches  -Dbak3.  The  k3  variation  also  holds  for 
thick  films  in  the  case  of  ’’instantaneous  kinetics”  (very  large  values  of  Dj).  We 
notice  that  the  same  dispersion  equation  77  oc  fc3  is  valid  for  the  transport  mechanism 
’’vaporization-condensation”  which  is  presumably  more  effective  than  the  vacancy 
transport  at  macro-  and  micro-scale  level  but  the  situation  could  be  opposite  at  the 
nano-scale  level  because  the  values  of  De,  Db,  and  £>/  need  not  be  the  same  as  they 
are  in  macroscopic  bodies. 

It  is  well-known  that  stresses  within  the  solid  can  destabilize  flat  interfaces  when 
rearrangement  of  mass  is  allowed.  Assuming  that  the  film  is  isotropic  and  it  is  suffi¬ 
ciently  thick  we  arrive  the  following  generalization  of  Eq.  6.8: 


{d^ 

[/ 


V 


k2NDb 


0" 


+  Di  7?  + 


(6.12) 


V 


-( DkDi  +  DbDtk2)k2  +  DiDsk 


rp2 

ak  —  (1  —  v) — 
/* 


=  0 


k?NDb 

where  v  is  the  Poisson  ratio,  T  is  the  uniaxial  stress.  Eq.  6.12  can  be  presented  in 
the  following  dimensionless  form: 

(VT+R  +  Kib)R  +  [x/mtATi*  +  Ksb)  +  KibKab]  [1  -  (1  -  v)*a]  T  -  0  (6.13) 
where  the  dimensionless  numbers  R,  Kib,  Ksb,  4>a,  T  are  defined  as  follows: 

V 


N 


_  .,  „  Di  Dsk  _  T2  ~_°k 

R  ~  k2NDb ’  ib  ~  Dbk ’  Kab  ~  Db  ’  T 

At  jR  •C  1  we  get  the  following  nondimensional  dispersion  equation: 
Kib  +  Kab  +  KibKBb 


R  = 


1  +  Kib 


[1  —  (1  —  i/)$a]T 


(6.14) 

(6.15) 


which  gives  a  consistent  generalization  of  the  dispersion  equation  of  Asaro  and  Tiller 
(1972)  [3],  compatible  also  with  the  static  stability  criteria  (see,  for  instance,  Grinfeld, 
1991  [11];  Nozieres,  1991  [21])  though  it  differs  from  the  criterion  of  Vasudev,  Asaro 
and  Tiller  (1975)  [25]exploring  the  same  mechanism  of  rearrangement  via  the  bulk 
diffusion  in  pre-stressed  crystal.  The  possibility  of  stress  driven  ’’necking” of  a  rod 
caused  by  the  migration  of  vacancies  has  been  noticed  by  Larche  and  Cahn  (1985) 

[17]. 


7.  CONCLUSION 

7.1.  The  main  results  of  this  project. 

The  elastic  misfit  stresses  in  TiAl  alloys  assume  huge  level  of  the  order  of  the  product 
of  the  elastic  moduli  times  misfit  deformations.  The  elastic  energy  associated  with 
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these  stresses  is  enormously  large  also.  Hence,  there  appears  a  driving  force  for 
the  rearrangement  of  the  material  elements  leading  to  the  evolution  accompanied 
by  the  creation  of  dislocations,  migration  of  vacancies,  development  of  interfacial 
corrugations,  etc... 

The  goal  of  this  research  project  was  two- fold:  i)to  develop  a  theoretical  approach 
which  permits  a  self-consistence  calculations  of  misfit  stresses  in  TiAl  lamellar  multi¬ 
layers  and  like  structures,  and  ii)to  develop  a  thermodynamic  approach  allowing  one 
to  study  the  possibility  of  stress  relaxation  by  means  of  mass  rearrangement  in  the 
vicinity  of  the  interface  between  the  lamellae. 

We  have  developed  a  general  scheme  and  establish  several  simple  general  formulae 
permitting  to  evaluate  numerically  and  to  analyze  qualitatively  misfit  stresses  in  TiAl 
multilayers.  On  the  basis  of  the  model  of  coherent  interfaces  we  arrive  to  the  conclu¬ 
sion  that  one  have  to  carefully  distinguish  between  the  ”on-substrate”  misfit  stresses, 
on  the  one  hand,  and  the  ”free-edge”  misfit  stresses,  on  the  other  case.  The  former 
case  is  more  relevant  for  the  stresses  appearing  at  different  processes  of  deposition: 
the  misfit  deformations  and  strains  in  the  ”  on-substrate”  multilayers  can  be  calcu¬ 
lated  with  the  help  of  the  master  system  5.35,  5.36  (for  the  general  anisotropic  case) 
and  simple  formulae  5.37,  5.38  for  the  case  of  isotropic  monolayers.  The  ”free-edge” 
misfit  deformations  and  stresses  can  be  calculated  with  the  help  of  the  master  system 
of  algebraic  equations  5.14  -  5.16  for  the  arbitrary  anisotropic  lamellae  with  different 
moduli.  Much  more  simple  working  formulae  can  be  used  in  the  case  of  isotropic 
lamellae: 

i  -  5.21,  5.22  for  the  case  of  equal  elastic  moduli  (N-layer  periodic  multilayer  in  width), 

ii  -  5.23-5.29  for  the  case  of  different  elastic  moduli  (2-layer  period  multilayer  in 

width), 

iii  -  5.33,  5.34  for  the  case  of  different  elastic  moduli  (N-layer  periodic  multilayer  in 

width). 

When  studying  possible  mechanisms  of  stress  relaxation  we  assumed  that  the  re¬ 
arrangement  is  accompanied  by  interfacial  diffusion  and  migration  of  the  vacancies. 
Since  the  full  consideration  of  the  coherent  interface  solid-solid  includes  lots  of  cal¬ 
culation  and  details  we  illustrate  our  results  and  approach  by  considering  the  much 
simpler  but  conceptually  a  very  close  problem  of  mass  rearrangement  in  the  vicinity 
of  a  stress-free  interface.  Using  one  of  the  approaches  of  irreversible  thermodynamics 
of  solids  we  explored  the  slow  evolution  of  corrugations  at  the  free  boundary  of  a  crys¬ 
talline  substance.  The  central  issues  of  this  study  are  the  roles  and  the  interaction 
of  the  three  dissipation  mechanisms:  i)the  surface  diffusion  of  the  species,  ii)the  bulk 
diffusion  of  vacancies,  and  iii)the  flow  of  vacancies  within  the  solid  from  vacuum.  For 
the  flux  of  vacancies  from  vacuum  we  have  postulated  the  constitutive  equation  6.5 
which  can  be  easily  justified  on  the  basis  of  Onsager  ideology  (this  means,  basically, 
that  the  constitutive  law  for  the  flux  should  obey  the  linearity  between  the  flux  and 
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corresponding  driving  force,  and  it  has  to  provide  the  decay  of  the  total  accumu¬ 
lated  energy).  The  new  dissipative  material  constant  £)/  appearing  in  this  modelling 
should  be  determined  by  quantitative  measurement  of  different  effects  relating  to  the 
penetration  of  vacancies  within  the  solid  from  vacuum  (which  is  nothing  else  but  re¬ 
arrangement  of  the  species  accompanied  by  producing  new  vacancies  in  the  vicinity 
of  the  free  surface).  One  of  the  effects  of  this  sort  is  the  evolution  of  surface  corruga¬ 
tions:  the  growth-decay  rate  of  the  corrugations  depends,  besides  other  parameters, 
on  the  value  of  D/.  We  have  derived  a  rather  simple  dispersion  equation  6.8  of  the 
growth  rate  of  the  surface  corrugation  rj  with  the  help  of  the  master  system  of  slow 
evolution  (Eqs.  6.1  -  6.6)  using  the  simplest  model  of  the  isotropic  elastic  solid  with 
vacancies  (Lifehitz,  1962  [18]).  The  equation  6.8  and  its  implications  6.9  -  6.11  allow 
one  to  find  those  corrugations  for  which  the  Mullins’  k4  -  law  gives  acceptable  approx¬ 
imation  and  those  for  which  it  does  not.  The  equations  6.8  -  6.10  clearly  show  that 
the  three  above  mentioned  dissipative  mechanism  cannot  be  described  as  the  linear 
superposition  even  for  small  disturbances  (as  they  are  treated  usually  in  the  existing 
literature). 

As  the  side  result  of  this  study  we  have  derived  the  dispersion  equation  6.12  of 
the  growth  rate  of  surface  corrugations  in  the  pre-stressed  crystal.  Our  dispersion 
equation  appears  to  be  consistent  with  the  known  thermodynamic  criterion  of  the 
stress  driven  rearrangement  instability  as  it  should. 

7.2.  The  most  promising  directions  of  future  research 

The  result  of  this  project  demands  further  studies  in  what  concerns  both  theory  and 
experiment.  The  following  directions  of  theoretical  studies  look  the  most  urgent  and 
promising: 

I  Misfit  stresses  in  the  case  of  large  misfit  deformations  (actually,  only  the  ’’in¬ 
plane”  misfit  deformations  can  be  supposed  small  geometrically  -  of  the  order 
of  several  %  -  in  the  first  approximation;  the  shear  deformations  are  actually 
extremely  big  of  the  order  of  hundred  %%). 

II  Misfit  stresses  in  the  plate-like  random  structure 

III  The  edge  effects 

•  The  edge  effects  in  vicinity  of  free  surface 

•  The  edge  effects  in  vicinity  of  intersection  of  two  free  surfaces 

•  The  edge  effects  at  the  corners 

IV  The  grain  structure  of  the  lamellae 

•  An  isolated  grain  within  a  lamella 

•  An  equilibrium  shape  of  an  isolated  grain  within  a  lamella 
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•  Stability  of  an  isolated  grain  within  a  lamella 

•  A  periodic  ensemble  of  the  grains  within  the  isolated  lamella 

V  Effective  elastic  moduli  of  laminated  elastic  structures  with  large  internal  stresses 

•  Effective  elastic  moduli  in  the  7/7  -  periodic  structure  (equal  elastic  moduli) 

•  Effective  elastic  moduli  in  the  a/7  -  periodic  structure  (different  elastic  moduli) 
VI  The  non-elastic  effects 

•  Relaxation  of  misfit  stresses  via  the  mechanism  of  the  stress  driven  rearrange¬ 
ment  instability  at  the  coherent  interfaces 

•  Relaxation  of  misfit  stresses  via  proliferation  of  misfit  dislocations 

•  Nucleation  of  misfit  dislocations  via  the  mechanism  of  the  stress  driven  rear¬ 
rangement  instability 
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INTERFACIAL  SLIDING  IN  BRITTLE  FIBROUS  COMPOSITES 


Michael  C.  Larson,  Assistant  Professor 
Department  of  Mechanical  Engineering 
Tulane  University 

Abstract 

The  extent  of  interfacial  sliding  is  pivotal  to  the  mechanical  behavior  of  ceramic  matrix  composites. 
This  research  illuminates  the  relative  importance  of  interfacial  friction,  interfacial  toughness,  and  interfacial 
roughness  in  determining  the  sliding  resulting  from  an  impinging  matrix  crack.  A  computational  model 
based  on  a  two  dimensional  surface  integral  formulation  was  used  to  model  interfacial  behavior.  During  the 
project  period,  a  novel  experiment  was  developed  for  cracks  at  frictional,  rough  interfaces  which  shows  the 
three-dimensional  nature  of  the  crack-interface  interaction.  The  experiments,  coupled  with  the  modeling, 
can  provide  guidance  for  tailoring  the  crack  resistance  of  composites  in  structural  applications. 
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INTERFACIAL  SLIDING  IN  BRITTLE  FIBROUS  COMPOSITES 

Michael  C.  Larson 


1.0  INTRODUCTION 

Research  into  ceramic  matrix  composites  has  sprung  from  the  desire  for  strong,  tough  materials  at 
extremely  high  temperatures  (1000-1500  degrees  Celsius).  Monolithic  ceramics  are  inherently  brittle. 
Oxide-oxide  material  systems  are  promising  for  introducing  a  measure  of  crack  resistance  while  retaining 
desirable  material  properties.  In  these  materials,  the  fiber/matrix  interface  plays  the  key  role  in  controlling 
the  global  toughness  of  the  composite.  The  degree  of  interfacial  toughness,  roughness,  and  friction  will 
decide  the  length  of  the  interfacial  sliding  zones.  The  size  of  these  sliding  zones  will  provide  a  measure  of 
pseudo-ductility  by  controlling  the  matrix  crack  spacing,  and  thus,  the  global  toughness  of  the  material. 

The  objective  of  this  work  is  to  link  a  mechanics-based  computational  modeling  technique  with  a 
materials-based  experimental  program  aimed  at  identifying  and  optimizing  toughening  mechanisms  in 
ceramic  composites.  The  authors  have  been  engaged  in  applying  the  surface  integral  method  to  model 
fractures  at  interfaces  between  brittle  constituents;  researchers  at  the  Wright  Materials  Laboratory  have 
shown  through  their  experiments  that  interfaces  play  the  key  role  in  determining  the  crack  resistance  in 
technologically  important  brittle  matrix  composites.  By  coupling  these  two,  we  are  seeking  to  refine  our 
mechanistic  representation  of  the  interfaces  in  order  to  predict  composite  performance  and  to  design  optimal 
interfaces. 

2.0  BACKGROUND  AND  MOTIVATION 

In  brittle  composites,  the  length  of  the  interfacial  sliding  zone  at  the  periphery  of  a  matrix  fracture  plays 
an  essential  role  in  determining  the  global  toughness  of  the  composite.  This  is  manifest  in  two  ways,  the 
sliding  relieves  the  stress  concentration  which  is  transferred  to  the  fibers  and  hence  regulates  fiber  failure, 
and  the  sliding  influences  matrix  fracture  spacing  which  in  turn  dictates  energy  dissipation. 
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Previous  research  by  the  author  employed  the  double  cleavage  drilled  compression  specimen  (DCDC) 
introduced  by  Janssen  to  observe  the  behavior  of  a  crack  as  it  proceeds  toward  and  past  a  frictional  fiber/ 
matrix  interface.  As  the  DCDC  specimen  is  compressed,  a  stable  fracture  sweeps  down  its  mid-plane 
(Warren,  1987).  The  specimens  were  modified  to  include  fibers  oriented  perpendicular  to  the  plane  of 
fracture.  Due  to  the  brittle  nature  of  glass,  the  assumption  of  local  crack  extension  at  a  constant  fracture 
toughness  is  valid  and  any  variation  of  the  crack  front  is  assumed  to  be  due  to  the  inserted  fibers.  The  study 
revealed  the  optimal  levels  of  friction  for  friction-only  interfaces  so  that  the  material’s  resistance  to  fracture 
is  maximized.  Crack  profiles  are  displayed  in  Figure  1  for  three  cases  of  a  specimen  as  is,  one  with  holes  at 
1.905  cm  away,  and  one  with  fibers  at  1.905  cm  away.  This  shows  the  significant  crack  retardation  that  can 
be  achieved  from  the  insertion  of  fibers. 


Figure  1.  Crack  fronts  taken  at  constant  time  intervals  for  DCDC  specimens  with  a)  no  hole  b) 
hole  and  no  fiber  and  c)  fiber 


That  work  suggested  that  the  influence  of  the  interface  in  controlling  the  extent  of  the  interfacial  sliding 
could  be  characterized  by  the  local  angle  between  the  three-dimensional  crack  front  and  the  surface  of  the 
fiber.  This  work  pursued  these  two  notions,  computationally  determining  the  combinations  of  interfacial 
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properties  that  will  yield  desired  interfacial  sliding  lengths  and  experimentally  measuring  the  crack  angle  as 
a  function  of  the  friction,  toughness,  and  roughness  of  the  interface. 

3-ft  SOLUTION  methodology 

3.1  Computational  Method 

A  two-dimensional  surface  integral  method,  similar  to  the  displacement  discontinuity  method  described 
by  Crouch  (1976),  is  used  to  investigate  the  behavior  of  a  crack  intersecting  a  planar  interface  possessing 
toughness,  roughness,  and  friction.  The  necessary  singular  fundamental  solutions  are  derived  from  the 
elasticity  solution  of  a  point  force  in  an  infinite  medium  (Wiles  and  Curran,  1982). 

The  total  relative  displacement  at  any  point  of  the  cracked  surfaces  is  obtained  by  summing  the  effects 
of  a  continuous  distribution  of  the  fundamental  solutions  over  the  entire  cracked  domain.  This  procedure 
leads  to  the  governing  singular  integral  equation 

<V*)  =  irij^’UWdA  -  (EQ1) 

35 

where  Ojj  are  the  stresses  at  a  field  point,  x  is  the  location  of  the  field  point,  T ijk  are  the  kernel  solutions,  8k 
are  the  crack  displacement,  and  £  is  a  vector  which  spans  the  fracture  surfaces  (all  referenced  to  a  fixed 
Cartesian  frame). 

The  approximate  numerical  solution  to  the  governing  integral  equation  is  obtained  by  transforming  it  to 
a  system  of  algebraic  equations  with  the  nodal  displacements  as  unknown  quantities.  The  opening  and 
sliding  displacements  are  approximated  over  discrete  elements  of  the  fracture  surface  by  interpolation 
functions  which  depend  upon  values  at  specified  nodal  points.  The  crack  domain  is  divided  into  E  discrete 

elements,  i.e., 


a,/*)  =  IJr  t)5 [e)(t)dA 


(EQ  2) 
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Crack  displacements,  8,  are  calculated  by  boundary  collocation.  Interpolation  functions  relate  an 
assumed  displacement  variation  to  the  nodal  values: 


5<e)(£)  =  X  h(i)(n)b[n)  .  (EQ3) 

n  =  1 

Constant  displacement  interpolations  are  used  for  the  majority  of  the  elements  which  compose  the  fractures 
in  order  to  reduce  the  computational  time  needed  per  simulation.  Elements  with  specialized  high  order 
interpolations  are  necessary,  however,  for  near-tip  regions  having  large  displacement  gradients.  This  higher 
order  interpolation  is  based  upon  the  first  two  terms  of  the  asymptotic  elastic  crack  displacement  field 
governed  by  the  stress  intensity  factor,  K: 


8(  =  - ^p1/2  +  fi,(v,  E,  n)p3/2  ,  (EQ4) 

Ej 2k 

with  p  being  the  distance  from  the  crack  front  and  the  subscript  i  is  not  a  summation  index  but  rather 
corresponds  to  the  fracture  mode,  i  =  I  or  II. 


Gauss  quadrature  is  used  to  numerically  integrate  the  distribution  of  Green’s  functions  for  the  two- 
dimensional  displacement  discontinuities  in  a  homogeneous,  isotropic,  linear  elastic  material.  For  frictional 
sliding,  the  normal  stress  is  compared  to  the  shear  local  to  each  element.  If  the  local  normal  traction  is 
tensile,  then  the  crack  elements  open  and  are  free  to  slide;  if  the  current  local  normal  traction  is  compressive, 
and  the  shearing  traction  exceeds  the  normal  traction  multiplied  by  a  friction  coefficient,  then  the  element  is 
closed  and  slides  against  a  restraining  shearing  traction.  The  slip  length  is  defined  as  the  length  that  the  crack 
has  traveled  up  the  interface. 


3.2  Experimental  Method 

The  goal  of  this  ongoing  research  effort  is  to  measure  experimentally  how  different  interfacial  surface 
roughness,  coupled  with  variable  friction  and  bonding,  act  to  limit  the  crack  opening  displacement  of  an 
impinging  fracture.  These  measurements  are  compared  with  computational  predictions  derived  from  the 
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two-dimensional  modeling  scheme  described  above.  The  measurements  bolster  confidence  in  using  the 
computational  tools  to  give  design  guidance  for  interfaces.  In  addition,  the  experiments  reveal  the  steady- 
state  angle  at  which  a  three  dimensional  crack  front  propagates  when  it  intersects  a  rough,  frictional 
interface.  This  quantity  is  useful  as  a  parameter  to  embed  into  three  dimensional  computational  simulations 
where  there  are  often  numerical  problems  associated  with  surfaces. 

The  dual  double  cleavage  drilled  compression  (dual-DCDC)  specimens  used  in  this  experiment,  a 
variation  on  the  geometry  used  by  Janssen  (1974),  were  fabricated  from  plate  glass  and  are  shown 
schematically  in  Figure  3  on  page  9.  The  dimensions  of  both  the  drilled  and  undrilled  halves  were  2.54  cm  X 
10.16  cm  X  0.9525  cm.  The  crack  starter  holes,  drilled  in  the  center  of  the  0.9525  cm  X  10.16  cm  faces,  have 
a  diameter  of  0.3175  cm.  A  half  containing  a  hole  was  butted  up  against  a  non-drilled  specimen  with  the 
sides  in  contact  possessing  a  known  roughness.  The  pre-stress  normal  to  the  interface  was  controlled  using  a 
hydraulic  piston  assembly  that  pressed  the  two  specimens  together.  The  dual-DCDCs  were  compressed  in  an 
special  apparatus  designed  specifically  for  these  tests,  which  is  shown  in  Figure  2. 

As  the  specimen  is  compressed,  a  crack  initiates  at  the  drilled  hole  due  to  the  stress  concentration 
located  there.  The  stress  concentration  is  similar  to  that  of  a  remote  stress  applied  to  a  hole  in  an  infinite 
sheet  (Warren,  1987).  Due  to  the  specimen  geometry,  a  stable  crack  can  be  propagated  perpendicular  to  the 
controlled  interface.  Because  of  the  brittle  nature  of  glass,  we  assume  that  local  crack  extension  is  at  a 
constant  fracture  toughness  and  any  variation  in  the  crack  front  shape  as  it  traverses  the  specimen  is  a 
consequence  of  the  roughness  and  friction  at  the  interface. 

While  the  cracks  are  propagating,  the  crack  opening  displacements  are  determined  using  crack  opening 
interferometry  (COI)  (Liecbti,  1994).  COI  is  based  on  the  concept  that  when  light  is  sent  through  two 
opposing  surfaces  the  reflected  beams  interfere  with  one  another  creating  interference  fringes  along  the 
specimen.  These  fringes  can  then  be  used  to  determine  the  amount  that  a  crack  is  opening  or  closing.  One 
can  represent  the  combination  of  the  two  beams  by  adding  the  two  light  vector  magnitudes  together  where 
each  light  vector  is  represented  by 
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Figure  2.  lOOkN  press  on  a  vibration  isolation  table  with  optics  for  COI  measurements. 


E  =  acos(<j>-  cot) 


and  their  combination  is 


Etotal  =  acosQ(<|>i  +  <t>2)-Grt) 

where 


a  =  aj2[l  +  cos(<(>2  -  ((>,)] 

is  the  amplitude  of  the  resulting  wave.  The  phase  angle  difference  is  related  to  the  displacement  8  by 


(EQ5) 


(EQ6) 


(EQ  7) 
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INTERFACE  WITH  VARIABLE 
ROUGHNESS,  FRICTION  & 


Figure  3.  Dual  DCDC  specimen  showing  the  crack  bowing  angle  %  at  the  frictional,  rough 
interface. 


<(»2 -<»i  =  2n ^  (EQ8) 

Substituting  into  the  previous  equation  and  knowing  that  the  intensity  is  proportional  to  the  square  of  the 
amplitude,  one  gets 

/  =  4a2^cos^Jt0j  9) 

It  can  be  seen  from  this  equation  that  the  dark  fringes  occur  at  where  the  intensity  is  a  minimum  (1=0)  and 
the  light  fringes  occur  where  the  intensity  is  at  a  maximum. 


I  =0=^’5  =  (2/I+l)r 

min  v  '2 


I  =$b  =  n\ 

max 


(EQ  10) 
(EQ  11) 
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Thus,  destructive  interference,  or  dark  fringes,  can  be  observed  where 


i,  «X 

n  =  - - 

2n2cosy 

and  constructive  interference,  or  bright  fringes,  can  be  seen  where 


(EQ  12) 


_  (2n-l)X 

4n2cosy  (EQ  13) 

where  n2  is  the  refractive  index  of  the  fluid  between  the  crack  surfaces.  For  these  experiments,  the  fluid  is  air 
and  the  refractive  index  is  equal  to  1.003.  For  a  wavelength  X  the  resolution  is  equal  to  1/4 A. 

The  mode  I  stress  intensity  factor  for  any  fringe  can  then  be  found  from  the  normal  crack  opening 
displacements  by 


k,  =  **_(  *  pn2 

1  16/i2cosyV(1_v2)A  r  ) 


(EQ  14) 


4.0  RESULTS 

4.1  Computational  Method 

To  reveal  the  near-tip  behavior  of  interfacial  sliding,  that  will  not  be  influenced  by  the  finite  geometry  of 
the  matrix  crack,  we  limit  the  absolute  sliding  length  to  be  less  than  8%  of  the  half  crack  length.  This  rule  of 
thumb  was  determined  by  keeping  the  difference  between  the  numerically  determined  stress  field  around  the 
crack  tip  within  5%  of  the  analytical  asymptotic  stress  field.  The  results,  which  are  in  agreement  with  those 
presented  in  Dollar  and  Steif  (1989),  for  the  relationship  between  the  normalized  slip  length  and  the 
Coulomb  coefficient  of  friction  are  shown  in  Figure  4. 


Roughness  is  assumed  to  be  approximated  by  a  triangular  arrangement  of  surface  integral  elements  with 
a  period,  P,  and  amplitude,  A.  Good  correlation  was  found  in  the  results  using  the  same  roughness  angle,  <p, 
defined  as 
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<)>  =  atan  — 


(EQ  15) 


as  long  as  the  period  of  roughness  was  no  more  than  ten  percent  of  the  sliding  length.  One  important  note 
about  rough  interfaces  when  there  is  not  bonding  (i.e.,  no  toughness)  is  that  the  sliding  zones  are  not 
continuous,  closed  sliding,  but  rather  are  interspersed  with  open  and  sticking  portions.  Sample  design 


0  0.2  0.4  0.6  0.8  1 

friction  coefficient,  p 

Figure  4.  Effect  of  friction  on  sliding  length  of  interfacial  toughness. 

surfaces  showing  sliding  lengths  as  a  function  of  toughness,  friction,  and  roughness  are  displayed  in 
Figure  5. 

4.2  Experimental  Method 

All  of  the  drilled  halves  of  the  dual-DCDC  specimens  were  pre-cracked  by  loading  to  1600  kg  with  pop- 
in  occurring  between  1200-1500  kg.  For  the  three  levels  of  roughness  (polished,  300  pm,  1000  pm)  at  a 
constant  friction  coefficient  (glass  on  glass)  and  no  interfacial  toughness,  a  range  of  lateral  compression  was 
applied  (from  0-0.25  MPa).  The  crack  bowed  at  a  much  lower  lateral  compression  for  the  polished 
specimens  than  the  abraded  specimens.  For  all  specimens  that  exhibited  this  crack  pinning  nature,  a  steady 
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Figure  5.  Interfacial  sliding  length  variation  for  combinations  of  interfacial  friction  and 
roughness  for  interfaces  with  no  toughness. 


state  angle  between  the  crack  front  and  the  interface  was  observed.  The  higher  the  roughness,  the  smaller  the 
angle  between  the  crack  front  and  interface  was.  The  angle  is  dependent  on  the  amount  of  lateral 
compression.Crack  kinking  was  also  observed  if  the  specimens  were  put  under  a  Mode  II  as  well  as  a  Mode 
I  loading.  These  specimens  exhibited  growth  out  of  plane  and  a  significant  pinning  at  the  interface. 


25  kPa 

30  kPa 

100  kPa 

250  kPa 

Polished 

63 

62 

58 

300  pm 

66 

46 

1000  pm 

67 

67 

37 

TABLE  1.  Average  crack  angles,  %,  as  a  function  of  the  nominal  normal  pressure  at  the 
interface  and  the  roughness. 
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5.0  CONCLUSIONS 


The  design  criteria  for  sliding  length,  e.g.  Figure  5,  demonstrate  that  designing  for  a  particular  sliding 
length  can  be  virtually  impossible  for  low  friction  Qi<0.01)  and  high  roughness  (>5  degrees)  interfaces.  The 
sliding  length  is  very  sensitive  to  variations  in  small  values  of  friction  coefficient  and  it  is  unreasonable  to 
pinpoint  a  particular  length.  At  high  friction  and  roughness,  the  interfacial  sliding  is  inhibited  for  all  but  the 
lowest  of  interfacial  bonding.  For  moderate  ranges  of  energy  release  rate,  coefficient  of  friction,  and 
roughness  angle,  tailoring  the  sliding  length  is  a  reasonable  objective.  The  most  control  comes  at  moderate 
friction  with  low  roughness. 


The  experiments  are  continuing  to  yield  valuable  information  concerning  the  influence  that  the  interface 
has  on  the  character  of  the  matrix  crack  growth.  The  stable  crack  growth  achieved  in  our  dual-DCDC 
geometry  used  in  the  test  rig  we  have  constructed,  coupled  with  the  optical  measuring  system  we  have  put  in 
place,  will  make  it  practical  to  evaluate  different  interface  types  to  simulate  those  being  considered  in 
practical  ceramic  matrix  composites.  This  information  will  also  be  useful  in  guiding  the  development  of 
three  dimensional  computer  models  for  near-interface  cracks. 
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OF  GAIN  SCHEDULED  MISSILE  AUTOPILOTS 


Douglas  A.  Lawrence 
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School  of  Electrical  Engineering 
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Abstract 

The  development  of  computational  and  analytical  tools  for  the  analysis  and  design  of  gain  scheduled 
missile  autopilots  is  summarized.  Matlab ™  routines  for  general-purpose  multivariate  splining  and 
numerical  solution  of  a  class  of  partial  differential  equation  are  described.  The  splining  routines  are  used 
to  create  a  Simulink ™  simulation  and  a  routine  to  extract  linearized  models  for  a  nonlinear  six  degree-of- 
freedom  air-to-air  missile  model.  Also,  a  stability  analysis  is  presented  that  lends  justification  to  the 
potentially  hazardous  yet  commonly  practiced  technique  of  designing  linear  autopilots  on  the  basis  of 
linearizations  about  non-equilibrium  conditions. 
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TOOLS  FOR  THE  ANALYSIS  AND  DESIGN 
OF  GAIN  SCHEDULED  MISSILE  AUTOPILOTS 


Douglas  A.  Lawrence 


1.  Introduction 

Gain  scheduling  techniques  originated  in  the  design  of  flight  control  systems.  Since  the 
aerodynamic  characteristics  of  an  aircraft  or  missile  can  vary  drastically  over  a  specified  flight  envelope,  a 
linear  controller  designed  on  the  basis  of  a  linearized  aerodynamic  model  about  a  single  flight  condition 
will  typically  fail  to  deliver  adequate  global  performance.  In  practice,  this  difficulty  is  remedied  by 
performing  linear  controller  designs  at  a  set  of  flight  conditions  that  spans  the  flight  envelope  and 
captures,  in  some  sense,  the  nonlinear  aerodynamic  effects.  The  parameters,  often  simply  gains,  in  the 
resulting  set  of  linear  controllers  are  then  interpolated  or  scheduled  as  a  function  of  a  selected  set  of 
scheduling  variables. 

Recently,  gain  scheduling  has  received  a  great  deal  of  attention  in  the  literature  as  a  formal 
control  system  design  method.  The  PI  has  collaborated  on  the  development  of  a  particular  approach  to 
gain  scheduling  that  emphasizes  nonlinear  systems  aspects  and  is  described  in  broad  terms  by  the  follows 
steps:  [6] 

•  determine  a  nonlinear  plant’s  family  of  constant  operating  points  or  equilibria, 
parameterized  by  a  designated  set  of  scheduling  variables; 

•  for  the  corresponding  family  of  linearized  plants,  design  a  family  of  linear  controllers  to 
meet  prescribed  design  goals  at  each  constant  operating  point; 

•  construct  a  gain  scheduled  controller  that  linearizes  to  the  appropriate  linear  controller  at 
each  constant  operating  point; 

•  check  nonlocal  performance  of  the  resulting  nonlinear  control  system. 

The  emphasis  in  this  approach  is  on  the  linearization  requirement  imposed  on  the  gain  scheduled 
controller  in  the  third  step.  Specifically,  technical  conditions  have  been  derived  for  the  existence  of  a  gain 
scheduled  controller  with  this  property.  Failure  to  satisfy  these  conditions  results  in  the  introduction  of 
so-called  hidden  coupling  terms  that  can  potentially  degrade  system  performance. 

The  Pi’s  participation  in  the  1996  AFOSR  Summer  Faculty  Research  Program  involved  the 
application  of  this  gain  scheduling  methodology  to  the  design  of  a  pitch  channel  autopilot  for  a 
hypothetical  air-to-air  missile  model.  [5]  The  work  described  in  this  report  builds  on  these  efforts  and  is 
divided  into  two  main  activities: 

•  develop  practical  tools  for  applying  this  gain  scheduling  methodology  to  missile  autopilot 
synthesis; 
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•  analyze  the  stability  mechanisms  at  work  in  existing  missile  autopilot  implementations. 

Each  of  these  topics  is  described  in  detail  in  the  following  two  sections.  The  final  section  contains  some 
concluding  remarks. 

2.  Tools  for  Gain  Scheduled  Autopilot  Synthesis 

In  general,  the  following  computations  must  be  performed  in  order  to  implement  the  gain 
scheduling  methodology  outlined  above: 

•  solve  systems  of  nonlinear  algebraic  equations  in  order  to  characterize  the  plant’s  family  of 
constant  operating  points; 

•  calculate  partial  derivatives  in  order  to  form  the  family  of  linearized  plant  models; 

•  perform  parameterized  linear  controller  design  based  on  the  family  of  linearized  plant 
models; 

•  solve  partial  differential  equations  to  construct  a  gain  scheduled  controller  that  meets  the 
linearization  requirement  described  above. 

In  realistic  problems,  these  steps  can  rarely  be  carried  out  analytically.  This  is  the  case  for  missile 
autopilot  design  problems  as  even  though  the  equations  of  motion  for  a  missile  have  an  analytical 
structure  in  terms  of  the  various  aerodynamic  coefficients,  the  coefficients  themselves  are  usually 
represented  in  tabular  form.  At  the  outset,  this  necessitates  a  numerical  approach  to  the  above 
computations.  Towards  developing  a  set  of  computational  tools  that  can  be  systematically  applied  in  a 
general  missile  autopilot  design  setting,  Matlab ™  code  has  been  developed  to  perform  the  following  tasks: 

•  construct  multivariate  splines  from  tabular  data; 

•  evaluate  partial  derivatives  of  these  multivariate  splines; 

•  generate  approximate  solutions  to  a  type  of  partial  differential  equation  that  arises  in  the 
implementation  gain  scheduled  controllers  that  satisfy  the  previously  described  linearization 
requirements. 

In  addition,  these  general  purpose  routines  have  been  used  to  construct  a  Simulink ™  simulation  of  a  six 
degree-of-freedom  air-to-air  missile  model  provided  by  the  Navigation  and  Control  Branch,  Armament 
Directorate,  Wright  Laboratory  (WL/MNAG),  Eglin  Air  Force  Base.  Also  for  this  missile  model,  a 
Matlab ™  routine  has  been  developed  that  calculates  a  linearized  model  about  a  user-selected  operating 
point.  In  the  remainder  of  this  section,  an  overview  of  these  general  purpose  Matlab ™  routines  along 
with  the  Simulink ™  missile  simulation  and  linearization  routine  is  presented. 

Bivariate  splining  can  be  implemented  in  the  Matlab ™  workspace  using  the  bsjnake  and 
bs  evalp  commands  that  correspond  to  function  m-files  of  the  same  name.  The  bs  make  command  is 
used  to  create  a  bivariate  spline  from  a  two-dimensional  data  table.  The  resulting  spline  is  evaluated 
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using  the  bsevalp  command.  Partial  derivatives  through  order  three  in  each  variable  can  also  be 
evaluated  by  specifying  optional  arguments.  On-line  help  messages  describing  proper  syntax  for  each  of 
these  commands  are  shown  in  Figures  2.1  and  2.2. 


»  help  bs_make 


Make  a  bivariate  spline  matrix  to  be  used  with  bs_evalp.m 

Usage:  [a]=bs_make(x,y,z) 

Inputs:  x-  vector  of  break  points  in  x  variable 
y  -  vector  of  break  points  in  y  variable 
z  -  matrix  of  function  values  with  rows  corresponding  to 
different  values  of  x,  columns  corresponding  to  different 
values  of  y 


Output:  a  -  matrix  containing  spline  data  to  be  used  with  bs  evalp.m 

Steven  Andrew 
10-16-97  Version  1.2 


Fig.  2.1  :  Matlab  on-line  help  message  for  the  bsmake  command 


»  help  bs  evalp 


Evaluate  a  bivariate  spline  at  a  point  (x,y) 
Usage:  [f|=bs_evalp(a,x,y[,x_der,y_der]) 


Inputs:  a  -  bivariate  spline  matrix  created  using  bs_make.m 
x  -  x  coordinate  of  evaluation  point 
y  -  y  coordinate  of  evaluation  point 
x_der  -  partial  order  (1,  2,  or  3)  w.r.t  x  variable  (optional) 
y_der  -  partial  order  (1,  2,  or  3)  w.r.t.  y  variable  (optional) 

Output:  f  -  spline  value  at  point  (x,y)  or  partial  at  (x,y) 
when  used  with  x  der  and  y_der 

Steven  Andrew 

2-17-97  Version  1.1 


Fig.  2.2:  Matlab  on-line  help  message  for  the  bs  evalp  command 


In  an  analogous  fashion,  trivariate  splining  can  be  implemented  in  the  Matlab ™  workspace  using 
the  tsjnake  and  tsjevalp  commands.  The  tsmake  command  is  used  to  create  a  trivariate  spline  from  a 
three-dimensional  data  table  organized  into  a  two-dimensional  array  in  a  particular  fashion.  The  resulting 
spline  is  evaluated  using  the  tsevalp  command.  Partial  derivatives  through  order  three  in  each  variable 
can  also  be  evaluated  by  specifying  optional  arguments.  On-line  help  messages  describing  proper  syntax 
for  each  of  these  commands  are  shown  in  Figures  2.3  and  2.4. 


»  help  ts  make 


Make  a  trivariate  spline  matrix  to  be  used  with  ts_evalp.m 

Usage:  [a]=ts_make(x,y,z,f) 

Inputs:  x  -  vector  of  break  points  in  x  variable 
y  -  vector  of  break  points  in  y  variable 
z  -  vector  of  break  points  in  z  variable 

f  -  matrix  of  data  points  with  initial  n  rows  corresponding  to 
different  values  of  x  and  m  columns  corresponding  to  different 
values  of  y.  Values  of  z  are  represented  by  o  nxm  blocks 
stacked  in  a  column  matrix  -  forming  a  matrix  with 
(n  x  o)  rows  and  m  columns 

Output:  a  -  matrix  containing  spline  data  to  be  used  with  tsevalp.m 

Steven  Andrew 

10-16-97  Version  1.2 


Fig.  2.3  :  Matlab  on-line  help  message  for  the  ts  make  command 


The  implementation  of  gain  scheduled  controllers  that  satisfy  the  previously  described 
linearization  requirement  often  involves  functions  of  the  scheduling  variables  that  are  solutions  to  a 
certain  type  of  partial  differential  equation.  Specifically,  with  ©  a  ^-dimensional  vector  of  scheduling 

variables,  a  function  x.  Rq  ->  R  is  sought  that  satisfies 

J^-(©)  =  G(©)  (2.1) 

where  ax/50  =  |ar/ 50!  •••  Sr/SG,]  is  the  gradient  of  x  and  G(©)  =  ^G1(0)  •••  Gq (0)|  has  been 

determined  from  the  preceding  steps  in  the  gain  scheduling  design  process.  A  solution  to  (2.1)  exists  if 
and  only  if  G(©)  satisfies  the  mixed-partial  conditions 
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»  help  ts_evalp 

Evaluate  a  trivariate  spline  at  a  point  (x,y,z) 

Usage:  [f]=ts_evalp(a,x,y,z[,x_der,y_der,z_der]) 

Inputs:  a  -  trivariate  spline  matrix  created  using  ts  make.m 
x  -  x  coordinate  of  evaluation  point 
y  -  y  coordinate  of  evaluation  point 
z  -  z  coordinate  of  evaluation  point 
x_der  -  partial  order  (1, 2,  or  3)  w.r.t.  x  variable  (optional) 
y  der  -  partial  order  (1,  2,  or  3)  w.r.t  y  variable  (optional) 
z_der  -  partial  order  (1, 2,  or  3)  w.r.t.  z  variable  (optional) 

Output:  f  -  spline  value  at  point  (x,y,z)  or  partial  at  (x,y,z) 
when  used  with  x_der,  y_der,  and  z_der 

Steven  Andrew 

2-17-97  Version  1.1 

» _ 

Fig.  2.4  :  Matlab  on-line  help  message  for  the  ts  evalp  command 


dGj  &Gj 

O&j  O&i 


(2.2) 


Since  typically  G(©)  will  not  satisfy  these  restrictive  conditions,  an  exact  solution  to  (2.1)  will  not  exist 


and  it  is  therefore  of  interest  to  pursue  approximate  solutions. 

One  approach  is  to  use  techniques  from  finite  element  analysis,  in  particular,  collocation 
methods.  This  will  next  be  described  for  the  case  q-  2 .  Let 


r  =  {©=(01,02)|©r^0^©r,e?^02^er} 


denote  a  rectangular  domain  in  R 2  on  which  an  approximate  solution  to  (2.1)  is  desired.  A  strictly 
increasing  sequence  in  each  variable,  ©,mm  =©,■  <•••<©”'  =©“■*,  1  =  1,2,  defines  a  grid  on  the 
domain  T . 

The  goal  is  to  construct  a  bivariate  spline,  denoted  i(©j,©2) ,  that  has  the  required  partial 
derivative  values  at  each  grid  point.  That  is, 

■J-tei.O*)  =  G*(©i,©'),  i  =  l . Hi,  y  =  l...,n2,  *  =  1,2  (2.3) 

Using  a  bivariate  spline  that  is  cubic  in  each  variable  also  allows  i(©j  ,©2)  to  have  continuous  second 
order  partial  derivatives.  Further,  spline  values  at  any  two  grid  points  may  be  freely  selected. 
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Calculating  and  subsequently  evaluating  a  bivariate  spline  with  these  properties  can  be 
accomplished  in  the  Matlab ™  workspace  using  the  commands  fem2d  and  eval2d.  On-line  help  messages 
describing  proper  syntax  for  each  of  these  commands  are  shown  in  Figures  2.5  and  2.6. 


»help  fem2d 


Implement  2-dimensional  finite  element  method 
to  construct  a  bivariate  spline  that  represents 
an  approximate  solution  to  a  type  of  partial 
differential  equation 

Usage:  [a]  =  fem2d(x,  y,  p,  q,  constl,  const2) 

Inputs:  x,  y  -  vectors  containing  grid  values  in  each  variable 
p  -  partial  derivative  values  w.r.t.  x  at  grids  points 

q  -  partial  derivative  values  w.r.t.  y  at  grids  points 

constl  -  spline  value  at  (x(l),  y(l» 
const2  -  spline  value  at  (x(n),  y(l)) 

Output:  a  -  matrix  containing  spline  data  to  be  used  with  eva!2d.m 

HaoyuLai 

11-5-97 


Fig  2.5:  Matlab  on-line  help  message  for  the  fem2d  command 


»  help  eval2d 


Evaluate  the  bivariate  spline  constructed  by  fem2d.m  at  a  point  (xe,  ye) 
Usage:  [val]  =  eval2d(a,  x,  y,  xe,  ye) 

Inputs:  a  -  matrix  containing  spline  data  generated  by  fem2d.m 
x,  y  -  vectors  containing  grid  values  in  each  variable 
xe,  ye  -  coordinates  of  evaluation  point 

Output:  val  -  value  of  the  spline  at  the  point  (xe,  ye) 

HaoyuLai 

7-25-97 


Fig  2.6:  Matlab  on-line  help  message  for  the  eval2d  command 
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One  of  the  objectives  of  this  work  was  to  convert  the  previously  mentioned  six  degree-of-freedom 
air-to-air  missile  simulation  from  Fortran  code  to  a  Simulink  model  in  which  the  aerodynamic 
coefficients  are  implemented  using  bivariate  and  trivariate  splines.  The  top  level  Sunulink ™  block 
diagram  for  the  simulation  is  shown  in  Figure  2.7.  The  aerodynamics  are  divided  into  two  blocks  labeled 
Forces/Moments  and  Rigid  Body  EOM.  The  Forces/Moments  subsystem  block  has  two  vector  inputs  and 
one  vector  output.  The  first  (top)  input  is  a  three-dimensional  vector  consisting  of  the  effective  pitch,  roll, 
and  yaw  control  surface  deflections,  all  in  radians.  The  second  (bottom)  input  is  a  seven-dimensional 
vector  consisting  of  angle-of-attack  (radians),  sideslip  (radians),  missile  velocity  (meters/sec),  roll,  pitch, 
and  yaw  body  rates  all  in  radians/sec,  and  altitude  in  meters.  The  output  is  a  six  dimensional  vector 
containing  the  three  components  of  the  aerodynamic  force  vector  (Newtons)  and  the  three  components  of 
the  aerodynamic  moment  vector  (Newton-meters),  all  relative  to  the  standard  body-fixed  coordinate 
system.  The  Rigid  Body  EOM  s-fimction  block  has  a  vector  input  corresponding  to  the  output  of  the 
Forces/Moments  subsystem  block  and  a  vector  output  corresponding  to  the  second  (bottom)  input  of  the 
Forces/Moments  subsystem  block.  The  Rigid  Body  EOM  s-function  block  implements  the  standard 
differential  equations  describing  the  motion  of  a  rigid  body  under  the  influence  of  external  forces  and 
moments.  Gravitational  forces  are  calculated  within  this  block  so  it  is  not  necessary  to  pass  the  Euler 
angles  to  the  Forces/Moments  subsystem  block. 


Rigid  Body 
EOM 


Six  Degree-of-Freedom  Air-to-Air  Missile  Simulation 


Fig.  2.7:  Simulink  top-level  block  diagram  for  six  degree-of-freedom  air-to-air  missile  simulation 
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Simulation  results  are  presented  for  the  initial  conditions  given  in  Table  2. 1  and  a  doublet  pulse 
described  by 

0  rad  0  <,  t  <  0.1  sec 
-10  -jig  rad  0.1  <>t<  0.2  sec 
IQ-^rad  0.2  <,t  <0.3  sec 
0  rad  t  £  0.3  sec 

applied  sequentially  to  each  of  the  three  control  surface  deflections,  with  the  other  two  fixed  at  zero 
deflection.  Responses  for  a  doublet  pitch  fin  deflection  are  shown  in  Figures  2.8  through  2.10.  Responses 
for  a  doublet  roll  fin  deflection  are  shown  in  Figures  2.11  through  2.13.  Responses  for  a  doublet  yaw  fin 
deflection  are  shown  in  Figures  2.14  through  2.16.  As  an  indication  of  the  execution  speed  of  the 
simulation,  one  second  of  simulation  time  on  a  75  MHz  Pentium  PC  with  16  MB  RAM  running  Matlab 
4.2c  and  Simulink  2.0  took  slightly  less  than  two  minutes  for  each  of  the  three  simulation  scenarios. 


Table  2.1:  Initial  Condition  Values  for  Simulink  Simulations 


Variable 

Initial  Value  Units 

angle-of-attack 

0 

radians 

sideslip  angle 

0 

radians 

missile  velocity 

500 

meters  per  second 

roll  angle 

0 

radians 

pitch  angle 

0 

radians 

yaw  angle 

0 

radians 

roll  rate 

0 

radians  per  second 

pitch  rate 

0 

radians  per  second 

yaw  rate 

0 

radians  per  second 

altitude 

10,000 

meters 
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Fig  2.8:  Angle-of-attack  a  (top),  side-slip  p  (middle),  and  velocity  (bottom)  vs.  time  for  doublet  pitch 

fin  deflection 
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xIO  * 


Fig  2.9:  Roll  rate  P  (top),  pitch  rate  Q  (middle),  and  yaw  rate  R  (bottom)  vs.  tir 

deflection 
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THETA  (degrees) 


Velocity  (meters/sec) 


0.2 


0  0.1  0.2  0.3  0.4  0.5  0.6  0.7  0.8  0.9  1 


Time  (sec) 

Fig  2.11:  Angle-of-attack  a  (top),  side-slip  p  (middle),  and  velocity  (bottom)  vs.  time  for  doublet  roll 

fin  deflection 


1 


R  (degrees/sec)  Q  (degrees/sec)  P  (degrees/sec) 


0.5  0.6  0.7  0.8  0.9  1 


(bottom)  vs.  time  for  doublet  roll  fin  deflection 
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Velocity  (meters/sec)  Beta  (degrees) 


6 


Time  (sec) 


Time  (sec) 


Fig  2.14:  Angle-of-attack  a  (top),  side-slip  (3  (middle),  and  velocity  (bottom)  vs.  time  for  doublet  yaw 

tin  deflection 
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R  (degrees/aec) 
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THETA  (degrees) 


25 


-0.5 


Time  (sec) 


Time  (sec) 


Fig  2.16:  Euler  angles  4>  (top),  0  (middle),  and  vy  (bottom)  vs.  time  for  doublet  pitch  yaw  deflection 
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It  is  also  of  interest  to  have  the  capability  to  obtain  linearized  models  of  a  missile’s  nonlinear 
aerodynamics  about  user-selected  operating  points.  These  models  serve  as  the  starting  point  for  the  linear 
design  phase  of  the  overall  gain  scheduling  design  process.  For  the  air-to-air  missile  model  considered 
here,  linearized  models  can  be  obtained  in  the  Matlab ™  workspace  using  the  jacob  command.  The  on¬ 
line  help  message  describing  proper  syntax  for  this  command  are  shown  in  Figure  2. 17. 


»  help  jacob 


This  function  calculates  state-space  coefficient  matrices 
for  a  linearization  of  the  six  degree -of-freedom  air-to-air 
missile  model  at  a  user-selected  operating  point  (x,u)  where 

x  =  [alpha  beta  vt  phi  theta  psi  P  Q  R  h]* 
u  =  [delp  delr  defy]' 

The  linearized  model  corresponds  to  state  and  input  vectors 
specified  above  and  an  output  vector  containing  total  accelerations 
(including  gravitational  effects)  along  body-fixed  x,  y,  and  z-axes. 

Usage:  [a,b,c,d]  =  jacob(x,u) 

Inputs:  x,u  :  state  and  input  vectors  specifying  the  operating 

point  values  of  the  variables  indicated  above 

Outputs:  a,b,c,d  :  coefficient  matrices  of  the  linearized  state  equation 
(a  is  10  x  10,  b  is  10  x  3,  c  is  3  x  10,  d  is  3  x  3) 


Steven  Andrew 
12-16-97  Version  1.2 


Fig  2.17:  Matlab  on-line  help  message  for  the  jacob  command 


As  an  example  of  jacob  command  usage,  consider  the  operating  point  for  which  x  corresponds  to 
the  initial  condition  values  listed  in  Table  2. 1  and  u  corresponds  to  zero  control  surface  deflections.  The 
dimensions  of  the  coefficient  matrices  returned  by  the  jacob  command  preclude  the  display  of  these 
matrices.  However,  by  dividing  the  ten  state  variables  into  longitudinal  and  lateral  components  according 
to 
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xlong 


a* 

Q 

P 

y, 

>  xlat  ~ 

R 

e 

* 

h  _ 

y. 

xlong 
.  xlat 


the  A  and  B  and  matrices  have  the  form 


■A lat-long 

,  B 

Blong 

0  ' 

1 

! 

o 

_ i 

0 

Biat, 

where,  for  example. 


'-0.2402 

1 

0 

0 

0 

'-0.2006' 

-62.31 

-0.2836 

0 

0 

0 

-140.7 

9.256 

0 

-0.0432 

-9.8 

0.0015 

f 

II 

0 

0 

1 

0 

0 

0 

0 

-500 

0 

0 

500 

0 

0 

In  addition,  transfer  functions  corresponding  to  selected  input-output  pairs  can  be  calculated.  Denoting 
the  effective  roll,  pitch,  and  yaw  control  surface  deflections  by  Sr,8p,  and  5^,  respectively,  and  the  roll. 


pitch  and  yaw  body  rates  by  P,  Q,  and  R,  respectively,  the  following  is  a  representative  sample: 


Q 


<*) 


R  <  \ 
T(s) 

5  y 


1,283 

s+21.4 

-140.7  s(s  -  0.00379)(s + 0,052 5)(s + 0.146) 

(s  -  0.0 1 88)(s + 0.03 1  ± y0.0242)(x + 0.262  ±  y  7.89) ' 
-140,7 
s+ 0.284 


3.  Stability  Analysis  of  Gain  Scheduled  Missile  Autopilots 

The  gain  scheduling  methodology  described  earlier  assumes  that  a  family  of  linearized  plants  has 
been  computed  about  a  family  of  constant  operating  points  or  equilibria.  The  objective  is  then  to  construct 
a  nonlinear  controller  with  the  property  that  the  resulting  nonlinear  closed-loop  system  has  a  prescribed 
family  of  linearizations  about  its  constant  operating  point  family  corresponding  to  constant  exogenous 
signals.  Once  this  has  been  achieved,  the  nonlinear  closed-loop  system  will  exhibit  bounded  behavior  for 
non-constant  but  sufficiently  slowly  varying  exogenous  signals. 

This  underlying  assumption  is  routinely  violated  when  gain  scheduling  is  employed  in  the  design 
of  longitudinal  missile  autopilots.  Specifically,  the  starting  point  for  the  design  is  the  short-period 
aerodynamics  described  by 
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a  = 


qS 


m 


[CN  cos(a)  -  CA  sin(a)]  +  gcos(0-a) 


+  Q 


Q  =  2*Lt 

h 


'M 


(3.1) 


where  a  is  angle-of-attack,  Q  is  pitch  rate,  0  is  pitch  attitude,  Vm  is  missile  speed,  q  is  dynamic 
pressure,  and  C  4 ,  are  axial  force,  normal  force,  and  pitching  moment  aerodynamic 

coefficients,  respectively.  Not  explicitly  shown  in  these  equations  is  the  dependence  of  the  aerodynamic 
coefficients  on  the  elevator  deflection,  8 .  A  control  surface  actuator  that  produces  an  elevator  deflection 
in  response  to  a  commanded  value  8C  is  often  modeled  by  a  linear  system  of  the  form 


d 

8 

0 

1 

8' 

roi 

dt 

5 

.-®2 

-2C 

h 

+ 

e 

O  K) 

1 _ 

Normal  acceleration  is  usually  the  variable  to  be  controlled  and  is  given  by 

qS 

r\z  =  — CN  +gcos(0) 
m 


(3.2) 


(3.3) 


The  gain  scheduling  methodology  applied  to  the  short  period  aerodynamics  and  the  actuator 
dynamics  yields  a  nonlinear  controller  of  the  general  form 
xc  =  a(xc,r\c,  ym,@) 

Sc  =  c(xc,i\c,ym,&) 


(3-4) 


where  Xq  is  the  controller  state,  t|c  is  commanded  normal  acceleration,  ym  is  a  vector  of  measurements 


available  to  the  controller,  and  ©  denotes  the  vector  of  scheduling  variables.  The  closed-loop  system 
formed  by  the  interconnection  of  (3.1),  (3.2),  and  (3.4)  can  be  cast  in  the  general  form 


where 


= 

ya  = 


x„  = 


fa (^o * ) 

(^a  ) 


(3.5) 


a 

Q 

vm~ 

h 

a 

8 

Ua  = 

e 

ya  = 

Q 

8 

5 

*C. 

Jlc. 

(3.6) 


Although  not  shown  explicitly,  the  dependence  on  altitude,  h,  comes  from  several  sources  including 
altitude  dependence  of  air  density  and  hence  dynamic  pressure,  possible  dependence  of  the  aerodynamic 
coefficients  on  altitude  and  mach  number,  and  altitude  may  be  one  of  the  scheduling  variables. 

As  a  result  of  the  gain  scheduling  design  process,  (3.5)  should  possess  a  family  of  exponentially 
stable  constant  operating  points  that  can  be  parameterized  by  constant  values  of  the  input  ua .  Then,  if 

ua  is  a  bounded  and  sufficiently  slowly  varying  exogenous  input  signal,  the  resulting  state  and  output 
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trajectories  will  remain  bounded.  However,  in  this  situation  the  input  ua  to  (3.5)  is  not  composed  of 
exogenous  signals  but  instead  consists  of  quantities  that  also  govern  the  longitudinal  aerodynamic 
behavior.  Specifically,  Vm ,  0 ,  and  h  evolve  according  to 

Vm  =  —  [CA  cos(a)  +  CN  sin(a)]  -  g  sin(8  -  a) 
tn  1 

0  =  Q  (3.7) 

h  =  Vm  sin(0  -  a) 


and  assuming  constant  normal  acceleration  commands, 
fic  =  0  • 

The  dynamics  (3.7),  (3.8)  can  also  be  represented  in  the  general  form 

_  .  xb  =  fb(Xb,Ub) 

b'  yb  =  hb(xb) 


(3.8) 

(3.9) 


The  definitions  (3.6)  and  (3.10)  imply  the  following  interconnection  of  the  subsystems  and  Y.b : 


Fig.  3.1:  Feedback  interconnection 


The  existing  stability  analysis  that  establishes  the  boundedness  property  of  xa  and  ya  for  a  slowly- 
varying  exogenous  input  ua  cannot  be  directly  applied  in  this  situation.  A  simple  example  illustrates  that 
the  coupling  firom  Za  to  can  have  a  destabilizing  effect.  Suppose  the  subsystem  is  the  bounded- 
input,  bounded-output  stable  linear  system  given  by 

xa  ~  ~xa  +ua 

ya  =  xa 

and  the  subsystem  T,b  is  the  marginally  stable  linear  system 
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*b  =  e 
yb  =  xb 

where  the  parameter  e  characterizes  the  coupling  from  T,a  to  I6.  When  e  =  0,  there  is  no  such 
coupling  and  can  be  viewed  as  an  exosystem  that  generates  constant  input  signals  for  Ea . 
Consequently,  the  corresponding  response  ya  =  xa  will  be  bounded.  For  e  *  0 ,  the  interconnected 
system  is  exponentially  stable  for  e  negative  and  unstable  for  e  positive,  regardless  of  how  small. 

In  order  to  develop  a  stability  analysis  for  the  general  class  of  systems  at  hand  in  which  the 
previously  mentioned  slowly  varying  results  can  be  incorporated,  a  modification  to  the  basic  set-up 
depicted  in  Figure  3.1  is  first  described.  Then,  standard  notions  of  input-output  stability  will  be  employed 
in  order  to  invoke  a  version  of  the  well-known  small-gain  theorem  [1]  [3]  [8]  for  this  modified  system. 

First,  it  will  be  assumed  that  the  subsystem  of  the  form  (3.5)  has  a  family  of  exponentially 
stable  operating  points  that  can  be  parameterized  by  constant  values  of  the  input  ua .  Specifically,  it  will 
be  assumed  that  there  is  a  continuously  differentiable  map  xa°(a)  defined  on  a  compact  set  T  containing 
the  origin  that  satisfies 

/a(*a°(“)>“)  =  Oforallaer. 

Further,  it  will  be  assumed  that  there  is  a  positive  constant  X  such  that  the  eigenvalues  of 

■T~(*a°(a),<x) 

OXq 

have  real  part  no  greater  than  -X  for  all  a  e  T .  Under  these  assumptions,  it  has  been  shown  [2],  [4],  [7] 
that,  for  some  positive  constant  ra,  there  exists  a  continuously  differentiable  map  V(z,  a)  defined  on 
Br  xT  that  satisfies 


cilNI2  ^  v(z*a)  ^  °2  IHI' 


dV 


(z>  a)/fl  (z  +  *a°(a),a)^-c3  ||z|j 


\\dV 

I  dz 

II  dV 

I  doc 


(r,a)|^c4|(z|| 

(z,a)Jsc5||z||2 


for  all  z  eBra, a  eT  for  positive  constants  cl  through  c5 . 

Related  to  the  subsystem  la  is  the  following  subsystem  to  which  the  previously  mentioned 
slowly  varying  analysis  will  be  applied. 

za  fa  (zo  i  ^  a  » ^ a  ) 

•  ®*a  va 

ya  ~  ha(za,CLa) 
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where  za  =  xa -xa°(<xa). 


~  3Xa° 

fa^a *aa»va)  =  fa^a^^a  (aj)»aa)  va 

and 

ha(za,aa)  =  ha(za+xa0(aa)). 

Note  that  in  addition  to  a  change  of  state  coordinates,  Sfl  results  from  cascading  integrators  on  the  input 

channels  of  la  .  Consequently,  if  the  outputs  of  I*  are  differentiated  to  yield  the  subsystem 

~  *b  =  fb{xb,ub) 

b'  wb  =  hb(xb,ub) 

where  wb  =  yb  and  hb (xb ,ub)  =  [dhb /8xb ](jc6 )fb (xb , ub ) ,  the  feedback  interconnection  of  I a  and  fb 
is  equivalent  as  far  as  the  output  ya  is  concerned  provided  the  newly  introduced  integrators  are  initialized 
properly.  Next,  the  following  notion  of  input-output  stability  is  of  interest. 

Definition.  A  system  I  with  input  u  and  output  y  is  said  to  be  small-signal,  finite-gain  -stable  if 
there  is  a  positive  constant  r  and  nonnegative  constants  y  and  p  such  that 

IHL^rHL+P 

for  all  u  e  Lx  with  flails  5  r . 

□ 

The  well-known  small-gain  theorem  can  be  specialized  to  the  feedback  interconnection  of  and  T.b  to 
yield  the  following  result. 

Theorem.  Suppose  the  subsystems  Ea  and  ~Lb  are  each  small-signal,  finite-gain  Lx  -stable.  That  is, 
there  is  a  positive  constant  r  and  nonnegative  constants  Ya>Pa>Yfc>  and  P*  such  that 

IkL  ^yJKIL  +P  * fora11  v*  ^  IhIL  *r 

and 

IhIL  -^IhlL  foral,  ub  w**  IhIL 

Then,  if  YaY*  <  1  there  holds 

*  l-YflYfr 

□  □□ 

Next,  the  desired  stability  property  of  the  subsystem  £a  is  established. 
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Theorem.  Under  the  above  assumptions  on  the  subsystem  S0 ,  if  ||za  (0)||  <  (Cl  /c2 )1/2ra  and  aa (0)  and 
va (t)  are  such  that  <xa(t)  eT  for  all  t  £  0 ,  then  is  small-signal,  finite-gain  Lm -stable. 

Proof.  The  time-derivative  of  V(z,a) ,  introduced  above,  along  trajectories  of  satisfies 

dV  ~  dV 

V(za,aa)  =  —(za,aa)fa(za,a.a,va)+—(za,<xa)va 
az  da 

~  ( za>aa )  fa(za  +  xa° (aa)>aa)  ^~(aa)va  +  (za  >aa)vo 

*  -c3|Kf+  |^^.aa)-^-(^a,aa)-^-(aa)  va 


+[csKI2  +c4^kll]kll 


Vzfl  eSffl,a0  er 


where,  from  previous  assumptions, 


If,  for  0  e  (0,1) , 


Ka  =  maxaer  h^-(a)  <°°. 


Kl.sO-e>S. 

c5 


*  -telK  |2  +CAKa\za\ 

_a^3  ir,  „  ,  c4 


<,  0  V(za,aa)+  — Ka  ||v  a  ||^/l^(z0  ,aa] 

c2  VC1 

Letting  fV(t)  =  Jv(za  (r),aa(f))  it  can  be  shown  that 


D+fV(t)  =  limsup  , 

/i— >0 


fV(t+h)-W(t) 

h 


from  which  the  comparison  lemma  [3,  Lemma  2.5]  yields 

0  c->  0ci 


H'(/)  S  e  zc2  fF(0)  +  -^/t 
2^ci  o 


IM-#* 


which  in  turn  gives 


IM'>||  ^  262  IKWll+^-Je  2e*°  T)|K  (t)|| dx 

V  ci  Zci  o 


The  bounds 


¥a  (0)|!  < 


ra  and  Ik  II  < 


c2c4AT£ 


-ra  ensure  that  \\za  (f)[|  <  ra ,  V  r  s  0  so  that  the 


preceding  analysis  is  valid  from  which 
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□  □□ 

Returning  to  the  longitudinal  missile  system,  the  subsystem  Za  corresponding  to  (3.1),  (3.2), 
(3.3),  and  (3.4)  will  posses  the  input-output  stability  property  established  in  the  preceding  theorem  by 
virtue  of  the  gain  scheduling  design  process.  Input-output  stability  of  the  subsystem  E  ^  corresponding  to 
(3.7)  and  (3.8)  will  now  be  addressed.  First,  consider  the  energy  function 
E(Vm,h)  =  ±mV%+mgh. 

The  time  derivative  of  E  along  trajectories  of  (3.7)  is  given  by 

E(Vm,h)  =  m  VmVm+rngh 
=  -VmFD 

where  Fd  =  qSCD  is  the  drag  force  with  CD  the  drag  coefficient  related  to  axial  and  normal  force 
coefficients  via 

CD  =  -  [CA  cos(a)  +  CN  sin(a)] . 

Since  the  drag  force  is  always  positive  for  a  nonzero  velocity, 

E(ym(t),h(t))<E(Vm(0),hm,  t*  0 

for  nonzero  initial  velocity  and  altitude.  It  can  be  concluded  that  that  Vm(t)  and  h(t)  are  bounded  for 
and  so  the  same  is  true  for  dynamic  pressure.  This  indicates  h(t)  is  bounded  for  V2. 0 and,  under 
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the  realistic  assumption  that  the  drag  coefficient  above  can  be  bounded  uniformly  in  angle-of-attack,  pitch 
rate,  and  elevator  deflection,  boundedness  of  dynamic  pressure  leads  to  boundedness  of  Vm(t)  for  t  £  0 . 
This  gives,  using  (3.7)  and  (3.8), 


K(  0 
0(0 
m 


£ 


<x(0 

Q(  0 
8(0 


+  Pt » 


0 


for  finite  p6.  Finally,  with  wb  =  {vm  Q  h  and  ub  as  in  (3.10),  there  holds 

IKIL^YiKL+Pfc 

with  as  in  the  previous  inequality  and  74=1. 

The  preceding  analysis  provides  an  explanation  for  the  performance  exhibited  by  longitudinal 
gain  scheduled  missile  autopilots  designed  on  the  basis  of  linearizations  about  non-equilibrium  conditions. 
The  complete  longitudinal  dynamics  can  be  cast  as  a  feedback  interconnection  of  two  subsystems  (£0  and 
).  Input-output  stability  of  Ea  has  been  argued  using  analysis  techniques  for  systems  with  sufficiently 
slowly  varying  input  signals.  Input-output  stability  of  1.b  has  been  established  in  an  ad-hoc  manner 
using  physical  insight.  Bounded  behavior  of  the  overall  system  follows  from  a  version  of  the  small-gain 
theorem.  This  analysis  is  not  intended  to  provide  quantitative  conditions  for  bounded  behavior,  but  rather 
to  validate  in  a  general  sense  the  potentially  hazardous  approach  of  designing  controllers  on  the  basis  of 
linearized  models  calculated  about  non-equilibrium  conditions. 


4.  Concluding  Remarks 

The  computational  tools  described  in  Section  2  constitute  preliminary  efforts  to  develop  a 
general-purpose  environment  for  the  analysis  and  design  of  gain  scheduled  missile  autopilots.  A  long¬ 
term  objective  is  a  systematic  procedure  whereby,  starting  with  tabular  aerodynamic  data  for  any  flight 
vehicle,  nonlinear  six  degree-of-freedom  simulations  and  trimming  and  linearization  routines  can  quickly 
and  easily  be  constructed.  At  the  present  time,  such  capabilities  are  restricted  to  a  specific  air-to-air 
missile  model,  but  the  experience  gained  from  this  exercise  will  serve  as  a  springboard  for  the 
development  of  a  completely  general  framework  in  the  future. 

The  stability  analysis  of  Section  3  lends  justification  to  the  potentially  hazardous,  yet  commonly 
practiced  technique  of  designing  linear  autopilots  based  on  linearizations  of  a  missile’s  nonlinear 
aerodynamics  about  non-equilibrium  conditions.  The  analysis  indicates  that  when  a  collection  of  linear 
autopilots  designed  in  this  manner  is  used  to  construct  a  gain  scheduled  autopilot,  bounded  system 
performance  can  be  expected,  thus  confirming  nonlinear  simulation  results  and  actual  behavior  in  the  air. 
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Abstract 

This  report  describes  the  aircraft  tire  properties  measured  with  Wright  State’s  Optotrak 
vision  system.  The  data  of  pre-defined  points  was  detected  in  a  3D  global  coordinate  system 
using  infrared  light  emitted  from  the  markers.  The  optical  device  is  synchronized  with  analog 
(non-optical)  sensors,  i.e.,  force  and  torque  transducers.  In  the  first  experiment,  the  tire  (KC-135 
and  F-16)  was  mounted  on  a  Tire  Force  Machine  (TFM),  where  the  vertical,  lateral  and 
longitudinal  (fore-aft)  forces  were  applied.  The  instrument  recorded  the  tire  deformation,  force, 
moment,  TFM  tolerance  and  table  movement  as  the  table  travels.  The  second  experiment 
involved  the  use  of  an  84”  dynamometer. 

It  was  observed  that  the  TFM  structure  sloppy  induces  an  error  as  high  as  1 00%.  Due  to  the 
tolerance  and  misalignment  of  the  testing  machine,  the  real  tire  deformation  measured  from 
global  coordinates  has  to  subtract  the  TFM  structure  motion.  In  other  words,  for  each  tire,  the 
data  measured  in  global  coordinates  must  be  converted  into  a  body  coordinate  system  in  order  to 
eliminate  the  noise.  The  dynamometer  data  was  contaminated  because  of  vibration  and  light 
deflection  of  the  glass;  therefore  a  retest  will  be  needed. 
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DETERMINATION  OF  3D  DEFORMATIONS,  FORCES  AND  MOMENTS  OF 
AIRCRAFT  TIRES  WITH  A  SYNCHRONIZED  OPTICAL  AND  ANALOG  SYSTEM 

Junghsen  Lieh 

Introduction 

A  CCD  is  an  electric  image  sensor  with  light  sensitive  pixels.  Numerous  researchers  are 
continuously  improving  the  device  and  further  explore  its  commercial  potentials.  A  vision  sensor 
is  capable  of  observing  geometry  and  movement  of  an  object.  The  images  can  be  recorded  with 
passive  or  active  light  sources  (regular,  laser  or  infrared)  reflecting  or  emitting  from  the  object.  A 
computerized  vision  sensor  contains  the  following  advantages: 

(1)  The  light  source  can  be  detected  at  a  remote  distance; 

(2)  The  sensor  has  a  high  spatial  resolution  (for  example  2048  pixels  per  array)  with  adjustable 
fields  of  view; 

(3)  The  measurements  may  be  displayed  in  real  time  with  hundreds  of  frames  per  second; 

(4)  The  system  is  portable  requiring  little  human  labor. 

The  measured  3D  displacements  may  be  converted  into  velocity  and  acceleration  vectors  of 
the  object  movement  or  into  a  local  coordinate  system  for  further  analysis.  This  data  can  be  used 
for  modal  analysis  (such  as  frequencies  and  amplitudes  with  Fast  Fourier  Transform).  It  can  also 
be  utilized  for  visualization  and  control  signals. 

It  was  observed  that  the  TFM  structure  sloppy  could  yield  an  error  in  tire  deformation  by 
nearly  100%.  The  real  tire  deformation  measured  in  global  coordinates  has  to  subtract  the  TFM 
structure  motion.  To  accomplish  this,  the  raw  data  must  be  converted  into  a  body  coordinate 
system  thus  to  eliminate  the  error  induced  by  the  TFM  structure  tolerance  and  misalignment. 

The  tire  data  for  the  84”  dynamometer  was  contaminated  by  the  glass  vibration  and  light 
deflection;  therefore  a  retest  will  be  needed. 
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Methodology 


This  section  describes  the  instrument,  the  procedure  of  measurements,  coordinate 

transformations,  and  results. 

1.  Synchronized  Optical  and  Analog  System 

The  schematic  of  the  measurement  system  is  shown  in  Figure  1.  The  system  includes 

•  A  3D-position  sensor  with  the  depth  of  field  ranging  from  3  to  20  meters.  The  highest 
accuracy  is  0.05-0.1  mm  in  x  andy  directions  and  0.1-0.15  mm  in  z  axis. 

•  An  image  control  with  maximum  marker  and  frame  sampling  rates  of  3,000  and  400  Hz.  The 
maximum  number  of  markers  to  be  detected  is  256. 

•  A  data  acquisition  unit  synchronizes  16  analog  transducers.  A  power  supply  and  a  system 
conditioner  were  located  between  the  analog  sensors  and  the  data  acquisition  unit. 

•  A  PC  with  an  interface  board  and  a  monitor.  The  interface  board  is  connected  to  the  system 
image  control  unit.  Data  is  displayed  in  the  monitor  and  stored  in  the  hard  drive. 

•  Multiple-channel  IRED  strobers  are  connected  to  the  system  image  control  unit.  IRED 
signals  are  collected  by  the  3-lens  sensor  and  processed  by  the  system  image  control  unit. 
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Figure  1.  Schematic  of  the  optical  and  coordinate  system  for  tire  measurement. 


Prior  to  testing,  active  IRED  markers  were  mounted  on  the  tire,  wheel,  machine  frame,  and 
TFM  table.  The  optical  device  was  placed  about  3  meters  from  the  tire  sidewall.  During  the 
lateral  testing,  the  optical  system  was  moved  to  one  end  of  the  TFM  table,  however,  the  system 
was  placed  at  the  side  of  table  for  the  rolling  testing.  The  analog  channels  for  forces,  moments, 
table  motion,  and  contact  pressures  were  connected  to  a  data  acquisition  unit,  which  is 
synchronized  with  the  optical  sensor. 

2.  Data  Processing,  Coordinate  Transformation,  and  Analysis 

The  coordinate  data  represents  3D  time  histories  (i.e.,  time-domain  positions)  of  the 
infrared  markers  mounted  on  the  tire,  wheel,  machine  frame  and  table.  The  analog  voltage 
represents  forces  and  moments,  which  were  amplified  through  calibration  factors.  To  reduce  the 
hard  drive  memory,  all  data  sets  were  stored  in  a  binary  form,  which  was  late  processed  into  an 
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ASCII  format  before  any  further  use.  MATLAB  programs  perform  the  downstream  processing 
tasks  (such  as  computations  and  plotting).  After  the  data  set  has  been  converted,  it  was  examined 
and  trimmed  in  order  to  extract  the  interested  sample  points.  The  final  version  of  the  data  was 
analyzed  and  provided  in  digital  and  graphic  formats. 

To  eliminate  the  tolerance  and  misalignment  in  the  testing  machine  thus  to  obtain  true  tire 
deformations,  all  samples  will  need  corrections  through  two  coordinate  transformations:  (1)  from 
optical  (global)  coordinates  to  intermediate  coordinates  and  (2)  from  the  intermediate  coordinate 
system  to  wheel  coordinates.  The  transformations  involve  the  computation  of  Euler  angles  of  the 
wheel  plane  for  each  time  step  (i.e.,  each  sample).  In  addition  to  coordinate  transformations,  the 
following  tasks  may  be  considered.  (1)  Process  the  data  into  a  readable  ASCII  format  to  allow 
the  tire  data  be  easily  used  for  scientific  and  applied  research.  (2)  Construct  hysteresis  curves.  (3) 
Establish  the  relations  among  3D  loading  (forces,  moments,  contact  pressures),  3D  deformations, 
coefficients  of  adhesion,  and  yaw  angles  in  the  wheel  coordinate  system. 

2.1.  Coordinate  Transformation  for  Noise  Elimination 

The  maximum  TFM  frame  movement  with  respect  to  table  directions  during  the  tire  test 
was  found  as  much  as  follows: 

Longitudinal  motion:  35  mm  (1.4”) 

Lateral  motion:  1 5  mm  (0.6” ) 

Vertical  motion:  5  mm  (0.2”) 

The  typical  time  history  of  lateral  and  longitudinal  wheel  center  movement  during  one  test  is 
shown  in  Figures  2  and  3.  The  TFM  misalignment  and  tolerance  obviously  induces  a  significant 
error  if  they  are  included  as  part  of  the  tire  deformation.  To  eliminate  the  noise,  it  will  be 
necessary  to  normalize  the  tire  data  with  respect  to  wheel  coordinates  through  the  following 
coordinate  transformations: 
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•  from  optical  (global)  coordinates  o-xyz  to  an  intermediate  coordinate  system  o- 
x'y'z',  as  shown  in  Figure  1, 

•  from  the  intermediate  coordinate  system  o  -x  'y  'z '  to  wheel  coordinates  o  -x  "y  "z 

Euler  angles  of  the  wheel  plane  for  each  sample  (i.e.,  each  time  step)  and  each 
measurement  are  needed,  and  they  were  determined  through  the  markers  located  on  the  TFM 
frame.  As  illustrated  in  Figure  1,  the  relative  position  vector  expressed  in  the  o-xyz  (optical  or 
global)  coordinate  system  is 

£a/o-=£a-£o'  (1) 

where  rA  and  r0,  were  measured  by  the  optical  system.  When  expressed  in  o-x'y'z' coordinates, 
the  local  position  vector  becomes 

£'mo<  =£i(rA-r0,)  (2) 

where  /?,  is  the  rotational  matrix  from  o-xyz  to  o-x'y'z'  satisfying  the  transformation  of  unit 
vectors  of  o  '-x'y'z' and  o-xyz  coordinate  systems: 

£o'=£l£o 

The  final  data  will  be  converted  into  the  wheel  coordinate  system  (o  -x"y  "z  "),  i.e., 

-  A/o'  —  —2  —  A/o' 

where  /?,  is  the  rotational  matrix  from  o-x'y'z'  to  o  -x"y"z". 

To  express  the  data  in  minimum  memory  and  processing  time,  most  data  sets  were  trimmed 
and  only  the  interested  points  were  used.  The  intention  of  the  task  is  to  provide  the  data  in  an 
easily  readable  format.  In  addition,  the  relation  among  3D  loading  (forces,  moments  and  contact 
pressures),  true  tire  deformations,  coefficients  of  adhesion,  and  yaw  angles  might  be  established. 
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Further  analysis  will  be  needed  beyond  the  Summer  Extension  Program  because  of  a  large 
number  of  data  sets  (approximately  250  MB).  The  goal  of  this  project  is  to  provide  the  basic 
information  for  better  understanding  of  the  fundamentals  of  aircraft  tire  mechanics  such  that  the 
data  may  be  utilized  for  more  accurate  vehicle  simulation  and  design. 

The  following  sections  are  the  results  of  such  analyses  in  both  untransformed  and 
transformed  coordinate  systems.  The  result  of  KC-135  tire  is  given  first,  and  then  followed  by 
the  F- 16  tire. 

2.2.  KC-135  Tire  on  Tire  Force  Machine  (TFM) 

The  tire  (SN  60150607)  with  a  undeformed  dimension  of  48.6”  (dia.)  x  16.8”  (width)  was 
inflated  to  170  psi.  Two  test  plans  were  performed,  i.e.,  free  rolling  and  lateral  deformation  tests 
(see  Figures  4  and  5).  In  both  test  plans,  the  normal  load  was  set  to  42k  lbs,  and  a  sampling  rate 
was  given.  17  markers  were  used,  with  10  on  the  tire  sidewall,  4  on  the  wheel  and  TFM  frame, 
and  3  on  the  moving  table.  In  addition  to  the  optical  markers,  7  analog  channels  recorded  the  3D 
tire  forces,  moments  and  table  position,  i.e.,  Fx»,  Fy,  Fz»,  Mx»,  My,  Mz«,  and  xT,  respectively. 

For  the  free  rolling  test,  the  tire  plane  was  aligned  with  the  fore-aft  direction  of  the  table 
(see  Figure  4).  Figures  6  and  7  show  the  sidewall  deformation  of  tire  with  4°-yaw  angle  in 
untransformed  and  transformed  coordinate  systems.  It  can  be  observed  that  the  untransformed 
tire  deformation  contains  a  significant  error. 

For  the  lateral  deformation  test  plan  as  shown  in  Figure  5,  the  tire  was  rotated  90°.  In  other 
words,  the  tire  sidewalls  were  in  the  lateral  direction  of  the  table.  The  lateral  force  applied  in  this 
test  plan  started  from  0  to  8k  lbs,  then  the  force  was  reduced  back  to  0  lbs.  The  force  was 
continuously  applied  from  0  to  -8k  lbs  and  finally  returned  to  0  lbs  again.  The  sidewall 
deformation  expressed  in  terms  of  global  coordinates  and  wheel  coordinates  is  shown  in  Figures 
8  and  9,  in  which  an  error  of  over  80%  was  observed  in  the  untransformed  data.  These  figures 
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indicate  that  the  true  sidewall  deformation  is  much  smaller  after  the  structural  misalignment  and 
machine  tolerance  have  been  eliminated. 

2.3.  F-16  Tire  on  Tire  Force  Machine  (TFM) 

An  F-16  Goodyear  baseline  radial  block  50  tire  (27.75x8.7R14.5/24PR  SN  33381603)  was 
inflated  to  300  psi.  Different  test  plans  were  performed.  However,  only  the  vertical  and  lateral- 
relaxation  tests  were  analyzed  due  to  the  limit  in  the  time  and  large  size  of  data  (see  Figure  10). 
Some  results  from  preliminary  analysis  for  the  other  tests  are  also  presented. 

In  the  vertical  test  plan,  5  markers  were  mounted  on  the  sidewall.  The  wheel  was  moved 
downward  until  the  normal  load  reached  20k  lbs,  it  was  then  moved  upward  to  its  original 
position.  Figures  11  and  12  show  the  sidewall  profile  in  the  global  coordinate  and  wheel 
coordinate  systems.  It  is  evident  that  without  coordinate  transformations,  the  sidewall  profile  is 
not  physically  explainable.  The  relation  between  the  normal  loading  and  tire  radial  deflection  is 
shown  in  Figure  13,  in  which  a  nearly  linear  Hooke’s  law  is  observed,  although  a  hysteresis 
phenomenon  exists. 

In  the  lateral-relaxation  test,  the  normal  load  was  set  to  20k  lbs.  There  were  18  markers 
used,  including  5  on  the  tire  sidewall,  4  on  the  wheel  and  TFM  frame,  3  on  the  moving  table,  and 
5  on  the  tire  periphery  (see  Figure  10).  The  table  traveled  back  and  forth,  with  lateral  force 
starting  from  0  to  -6k  lbs,  then  returned  to  0  lbs.  The  force  was  continuously  applied  from  0  to 
6k  lbs  and  finally  returned  0  lbs  again.  The  untransformed  and  transformed  sidewall 
deformations  are  shown  in  Figures  14  and  15.  Since  the  untransformed  deformation  includes  the 
TFM  frame  movement,  therefore,  an  error  of  over  100%  is  observed.  As  the  lateral  force 
exceeded  certain  value,  the  tire  near  the  wheel  interface  will  buckle  (see  Figure  15).  The  tire 
periphery  bending  (relaxation)  is  shown  in  Figures  16  and  17,  again,  a  significant  error  exists  if 
the  untransformed  data  is  used. 
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The  preliminary  results  of  other  testing  are  illustrated  in  Figures  18  and  19,  in  which  Figure 
18  is  tire  sidewall  deformation  for  the  free  rolling  test,  and  Figure  19  is  for  the  tire  contact 
pressure.  Further  research  work  will  be  needed  to  complete  the  data  processing  and  analysis  in 
the  future,  depending  on  the  funding  availability. 

2.4.  F-16  Tire  on  84”  Dynamometer 

For  safety  consideration,  a  glass  was  placed  between  the  optical  system  and  the 
dynamometer.  When  machine  was  operated,  it  was  found  that  the  glass  vibrated  with  the 
dynamometer.  The  light  going  through  the  glass  has  been  contaminated.  It  is  concluded  that  the 
data  can  no  longer  be  used,  and  further  retest  without  glass  is  recommended. 

3.  Summary  and  Conclusions 

The  use  of  a  synchronized  optical  and  analog  device  allows  3D  images  and  analog  data  be 
collected  concurrently.  The  relation  between  the  tire  deformation  and  force/moment  under 
various  operating  conditions  is  critical  to  the  vehicle  dynamics  simulation  and  control  synthesis. 
Improper  tire  data  can  result  in  significant  errors  in  the  vehicle  design,  which  in  turn  may  lead  to 
safety  problems.  The  testing  machine  itself  (particularly  the  TFM)  was  found  to  move  with  large 
displacements  along  the  table  longitudinal  and  lateral  directions  (i.e.,  35  mm  and  15  mm, 
respectively).  The  TFM  frame  sloppy  itself  could  have  been  greater  than  the  true  tire 
deformation,  especially  for  small  tires  (such  as  F-16  tires).  Therefore,  any  tolerance  or 
misalignment  in  the  testing  machine  should  be  eliminated  in  order  to  obtain  true  tire  properties. 


It  may  be  possible  to  redesign  the  TFM  mechanism  or  at  least  replace  some  wom-out  parts 
in  order  to  minimize  the  sloppy.  However,  this  will  certainly  increase  the  cost.  In  addition,  no 
matter  how  accurate  the  new  mechanism  will  be,  it  still  expects  some  tolerance  in  the  machine. 
To  completely  eliminate  the  tire  measurement  noise,  the  simplest  yet  cheapest  approach  will  be 
to  adopt  the  method  used  in  this  project 
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The  data  sets  collected  during  the  summer  of  1996  include  the  machine  tolerances  and 
misalignments.  After  coordinate  transformations,  these  error  can  be  eliminated.  As  is  expected, 
the  research  work  can  lead  to  tremendous  improvements  in  the  development  of  accurate  and 
fundamental  tire  characteristics.  Besides,  the  data  may  be  utilized  for  verification  of  existing 
finite  element  simulation  codes.  The  establishment  of  accurate  relationships  among  tire 
deformations,  forces,  moments,  contact  pressures,  yaw  angles,  and  coefficients  of  adhesion  will 
benefit  the  Air  Force  and  aircraft  industry. 
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Figure  3.  TFM  frame  longitudinal  movement  during  test. 
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17  markers  plus 
7  analog  channels  for  forces, 
moments  and  table  position 


\  table  motion 
\  (fore-aft) 


Figure  4.  KC-135  tire  free  rolling  testing. 


Figure  5.  KC-135  tire  lateral  deformation  test. 


Figure  6.  KC-135  tire  untransformed  sidewall  profile  under  free  rolling  test. 


Transformed  Sidewall  Position(Wheel  Center  as  Reference),  mm 
Figure  7.  KC-135  tire  transformed  sidewall  profile  under  free  rolling  test. 
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Untransformed  Sidewall  Position,  mm 


Figure  9.  KC-135  tire  transformed  sidewall  deformation  under  lateral-relaxation  test, 


Figure  10.  Schematic  of  F-16  tire  under  vertical  and  lateral-relaxation  tests. 
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Figure  13.  The  relation  between  normal  loading  and  vertical  tire  deflection  for  F-16  tire. 
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Figure  16.  F-16  tire  untra 
test. 
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Figure  17.  F-16  tire  trans 
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Figure  18.  F-16  tire  untransformed  sidewall  deformation  under  4°-yaw  free  rolling  test  (normal 
load=15kip). 


Figure  19.  F-16  tire  contact  pressure  under  4°-yaw  free  rolling  test  (normal  load  =  15kip). 
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Control  of  Linear  Systems  with  Rate  Limited  Actuators 
and  Its  Applications  to  Flight  Control  Systems 

Zongli  Lin 

Department  of  Applied  Mathematics  and  Statistics 
State  University  of  New  York  at  Stony  Brook 


Abstract 

A  design  technique  is  proposed  for  obtaining  feedback  laws  that  cause  the  output  of  a  linear 
system  subject  to  actuator  rate  saturation  to  track  a  desired  command  signal.  This  design  tech¬ 
nique  combines  two  design  techniques  recently  developed  for  linear  systems  with  position  limited 
actuators,  piecewise-linear  LQ  control  ([20])  and  low-and-high  gain  feedback  ([18]),  and  hence 
takes  advantage  of  both  design  techniques,  while  avoiding  their  disadvantages.  In  the  case  that 
only  the  output  is  available  for  feedback,  the  performance  of  the  state  feedback  law  is  preserved 
by  the  use  of  a  fast  observer.  The  proposed  design  technique  is  also  implemented  in  the  Matlab 
environment.  An  open-loop  exponentially  unstable  fighter  aircraft  is  used  to  demonstrate  the 
effectiveness  of  the  proposed  control  design  method. 
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Control  of  Linear  Systems  with  Rate  Limited  Actuators 
and  Its  Applications  to  Flight  Control  Systems 

Zongli  Lin 


1  Introduction 

Every  physical  actuator  is  subject  to  constraints.  These  constraints  include  both  position  and 
rate  saturation.  In  the  past  few  years  there  has  been  much  interest  concerning  the  stabilization  of 
linear  systems  with  position  saturating  actuators,  resulting  in  several  promising  design  techniques. 
In  this  work,  we  will  recourse  to  the  low-and-high  gain  (LHG)  design  technique  [18]  and  the 
piecewise  linear  LQ  control  (PLC)  design  technique  [20].  Design  techniques  that  are  conceptually 
similar  to  the  PLC  design,  the  high  performance  bounded  control  (HPB),  the  scheduled  low  gain 
feedback  and  scheduled  low-and-high  gain  feedback  were  recently  developed  in  [15],  [14]  and  [7], 
respectively.  Additional  related  work  might  be  found  in  the  above  mentioned  papers  and  in  [2]. 

While  actuator  position  saturation  has  been  addressed  in  the  recent  literature,  few  design 
techniques  are  currently  available  to  deal  with  actuator  rate  saturation,  which  often  presents  a 
more  serious  challenge  to  control  engineers,  especially  flight  control  engineers.  It  is  known  ([1]) 
that  actuator  rate  saturation  could  induce  a  considerable  phase-lag,  which  when  associated  with 
rate  saturation  has  a  destabilizing  effect.  For  example,  investigators  have  identified  rate  saturation 
as  a  contributing  factor  to  the  recent  mishaps  of  the  YF-22  [3]  and  the  Gripen  [5]  prototypes  and 
the  first  production  Gripen  [17].  For  further  discussion  on  the  destabilizing  effect  of  actuator  rate 
saturation,  see  [1]. 

As  a  result,  while  visiting  Wright  Laboratory  as  a  1996  AFOSR  summer  faculty  research 
associate,  the  PI  and  his  lab.  focus  point  at  Wright  Laboratory,  Dr.  Siva  Banda  have  chosen  the 
following  topic  to  work  on:  control  of  linear  systems  with  position  and/or  rate  limited  actuators. 
The  Pi’s  participation  in  the  AFOSR  Summer  Faculty  Research  Program  has  been  very  fruitful. 
In  what  follows,  we  briefly  summarize  the  results  obtained  during  this  Summer  Faculty  Research 
Program  and  the  subsequent  Summer  Research  Extension  Program  as  follows. 

°Wright  Fellow,  Chief  of  WL/FIGC  (Flight  Control  Division,  Flight  Dynamics  Directorate),  Wright  Laboratory, 
Wright-Patterson  Air  Force  Base,  Dayton,  Ohio. 
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1.  A  composite  nonlinear  feedback  design  based  on  a  nominal  linear  feedback  is  proposed  for 
(exponentially  unstable)  second  order  linear  systems  with  position  limited  actuators.  The 
design  yields  nonlinear  state  feedback  laws  that  both  increase  the  speed  of  closed-loop  sys¬ 
tem  response  to  the  command  input  and  reduce  the  overshoot,  while  not  imperiling  the 
performance  achieved  by  the  nominal  linear  feedback  controller  in  the  face  of  actuator  posi¬ 
tion  saturation.  As  a  result,  arbitrarily  fast  step  response  is  achieved  with  arbitrarily  small 
overshoot.  A  flight  control  system  is  used  to  demonstrate  the  applicability  and  effectiveness 
of  the  design. 

Further  details  on  this  result  can  be  found  in  [8]. 

2.  A  robust  stabilizing  state  feedback  design  technique  for  (exponentially  unstable)  linear  sys¬ 
tems  with  rate  limited  actuators  is  proposed.  The  design  combines  two  design  techniques 
recently  developed  for  linear  systems  with  position  limited  actuators,  piecewise-linear  LQ 
control  [20]  and  low-and-high  gain  feedback  [18].  An  F-16  class  open-loop  unstable  fighter 
aircraft  model  is  used  to  demonstrate  the  applicability  and  effectiveness  of  the  proposed 
design  method.  The  combined  design  takes  advantages  of  both  design  techniques,  while 
avoiding  their  disadvantages.  As  a  result,  the  resulting  nonlinear  feedback  law  achieves  fast 
transient  response  and  disturbance  rejection  in  the  face  of  actuator  uncertainties.  Moreover, 
the  design  also  guarantees  a  fixed  domain  of  attraction,  which  does  not  shrink  as  the  values 
of  the  design  parameters  are  increased  to  achieve  higher  degree  of  disturbance  rejection. 

Further  details  on  this  result  can  be  found  in  [9]. 

3.  It  is  shown  that,  if  a  linear  system  is  asymptotically  null  controllable  with  bounded  con¬ 
trols,  then,  when  subject  to  both  actuator  position  and  rate  saturation,  it  is  semi-globally 
stabilizable  by  linear  state  feedback.  If,  in  addition,  the  system  is  also  detectable,  then  it  is 
semi-global  stabilizable  via  linear  output  feedback. 

Further  details  on  this  result  can  be  found  in  [6]. 

4.  The  design  technique  proposed  in  [9]  (as  described  in  Item  2)  for  stabilization  of  a  linear 
system  with  rate-limited  actuators  is  utilized  to  design  feedback  laws  that  cause  the  system 
output  to  track  a  desired  command  signal.  The  tracking  control  law  is  robust  in  the  sense 
that  it  rejects  the  disturbance  associated  with  the  actuators  in  the  face  of  actuator  uncer- 
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tainties.  An  open-loop  exponentially  unstable  fighter  aircraft  is  used  to  demonstrate  the 
effectiveness  of  the  proposed  control  design  method. 

A  paper  that  describes  the  state  feedback  result  has  been  presented  at  1997  AIAA  GNC 
Conference  [10].  A  complete  version  of  the  paper  that  includes  both  state  feedback  and 
output  feedback  results  has  been  written  [11]  and  will  be  included  in  this  final  report. 

5.  Software  in  MATLAB  is  developed  for  the  implementation  and  simulation  of  the  results 
obtained. 

Due  to  space  limitation,  in  this  final  report,  we  only  include  detailed  results  that  have  not 
been  published,  i.e.,  [11].  More  specifically,  in  [9]  we  proposed  a  method  of  designing  stabilizing 
feedback  control  laws  for  linear  systems  taking  into  account  the  effect  of  actuator  rate  saturation 
([9]).  The  proposed  feedback  design,  referred  to  as  the  combined  PLC/LHG  state  feedback  design 
method,  combines  the  two  recently  developed  design  techniques,  the  PLC  and  the  LHG  design 
techniques,  both  originally  proposed  for  linear  systems  with  actuator  position  saturation.  It 
inherits  the  advantages  of  both  design  techniques,  while  avoiding  their  disadvantages.  Thus,  exact 
knowledge  of  the  dynamics  of  the  actuators  will  not  be  needed  and  the  actuator  disturbances  can 
be  rejected.  In  particular,  in  the  LHG  design  [18],  a  low  gain  feedback  law  is  first  designed  in  such 
a  way  that  the  actuator  does  not  saturate  in  position  and  the  the  closed-loop  system  remains 
linear.  The  gain  is  chosen  low  to  enlarge  the  region  in  which  the  closed-loop  system  remains 
linear  and  hence  to  enlarge  the  domain  of  attraction  of  the  closed-loop  system.  Then,  utilizing  an 
appropriate  Lyapunov  function  for  the  closed-loop  system  under  this  low  gain  feedback  control 
law,  a  linear  high  gain  feedback  control  law  is  constructed  and  added  to  the  low  gain  feedback 
control  to  form  the  final  LHG  feedback  control  law.  Such  a  linear  low-and-high  gain  feedback 
control  law  speeds  up  the  transient  response  for  the  state  in  a  certain  subspace  of  the  state  space 
and  is  able  to  stabilize  the  system  in  the  presence  of  input-additive  plant  uncertainties  and  to 
reject  arbitrarily  large  bounded  input-additive  disturbances.  The  disadvantage  of  this  control 
law  is  that  the  transient  response  for  the  state  outside  that  subspace  of  the  state  space  remains 
that  of  the  low  gain  feedback,  which  is  typically  sluggish  (due  to  low  feedback  gain  for  a  large 
domain  of  attraction).  On  the  other  hand,  the  aim  of  the  PLC  [20]  (HPB  control  [15])  or  the 
scheduled  low  gain  feedback  design  [14]  schemes  is  to  increase  the  feedback  gain  either  piecewisely 
or  continuously  while  adhering  to  the  input  bound  as  the  trajectories  converge  toward  the  origin. 
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Such  a  design  results  in  fast  transient  speed  for  all  states.  However,  it  lacks  robustness  to  large 
uncertainties  and  the  ability  to  reject  disturbances. 

Our  objective  is  to  utilize  the  combined  PLC/LHG  state  feedback  design  technique  developed 
in  [9]  for  stabilization  to  design  state  feedback  laws  that  achieve  robust  tracking  performance. 
The  signals  to  be  tracked  are  modeled  as  the  state  of  a  reference  system  and  the  tracking  control 
problem  is  formulated  as  an  output  regulation  problem  ([4],  [13]).  In  the  case  that  only  the  output 
is  available  for  feedback,  the  performance  of  the  state  feedback  laws  is  preserved  by  the  use  of  a 
fast  observer.  We  note  here  that  the  PLC  design  method  as  originally  proposed  in  [20]  is  only  for 
state  feedback.  Although  the  LHG  output  feedback  design  is  given  in  [18],  its  design  and  analysis 
do  not  carry  through  in  the  combined  PLC/LHG  design,  since  the  closed- loop  system  under  the 
combined  PLC/LHG  state  feedback  law  is  discontinuous.  Indeed,  discontinuity  in  the  PLC /LHG 
state  feedback  laws  is  the  primary  source  of  difficulty  in  its  implementation  with  observer  state. 

The  remainder  of  this  report  is  organized  as  follows.  In  Section  2,  we  precisely  formulate  the 
two  problems,  the  state  feedback  robust  tracking  problem  and  the  output  feedback  robust  tracking 
problem.  Section  3  recalls  from  [9]  the  combined  PLC/LHG  state  feedback  design  algorithm  that 
was  proposed  for  constructing  stabilizing  state  feedback  laws.  This  design  algorithm  is  used  in 
Section  4  to  construct  feedback  laws  that  solve  the  state  feedback  robust  tracking  control  problem 
as  formulated  in  Section  2.  Section  5  deals  with  the  case  that  only  output  is  available  for  feedback. 
In  this  case,  a  fast  observer  is  used  to  implement  the  state  feedback  laws  constructed  in  Section 
4  and  shown  to  preserve  performance  of  the  state  feedback  laws.  In  Section  6,  a  fighter  aircraft 
model  is  used  to  demonstrate  the  effectiveness  of  the  proposed  design  algorithm.  Concluding 
remarks  are  made  in  Section  7. 

2  Problem  Formulation 

Consider  the  linear  dynamical  system 
'  x  =  Ax  +  Bv,  ®(0)  eTcR" 

«  =  sat A(-Tv  +  Tu  +  d),  t)(0)eVclRm 

<  w  =  Sw,  w(0)  €  W  C  IR5  (1) 

e  =  Cx  +  Qw 
,y  =  Ex 
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where  the  first  equation  describes  the  plant,  with  state  x  £  IR.n,  input  u  £  IRm  and  output 
y  £  IR9,  the  second  equation  represents  actuator  dynamics  with  state  v  £  Hm  and  is  subject 
to  disturbance  d  £  Hm  and  rate  saturation  satA('))  the  third  equation  defines  the  reference 
signal  —Qw  £  IRP  to  be  tracked  by  the  plant  output  Cx  £  1RP.  For  A  =  (Ai,  A2,  •  •  • ,  Am), 
Aj  >  0,  the  actuator  rate  saturation  function  sat  a  :  IRm  — »  !Rm  is  the  standard  saturation 
function,  i.e.,  satA(v)  =  [satA1(vi),satA2(w2),  •  •  •  ,satAm(vm)],  satA^Ui)  =  sign(uj)  min{Aj,  |vj|}, 
the  positive  definite  diagonal  matrices  T  =  diag(ri,  72,  •  ■  • ,  rm)  represents  the  “time  constants”  of 
the  actuators.  Finally,  all  three  sets  X,  V  and  W  are  bounded. 

We  also  make  the  following  assumptions  on  the  system. 

Assumption  2.1  The  pair  ( A,B )  is  stahilizable; 

Assumption  2.2  The  eigenvalues  of  S  have  nonnegative  real  parts. 

Assumption  2.3  The  disturbance  is  uniformly  bounded  by  a  known  (arbitrarily  large)  constant 
D,  i.e.,  ||d(f)||  <  D,  Vf  >  0. 

Remark  2.4  We  note  that  Assumption  2.1  is  obviously  necessary,  Assumption  2.2  does  not  in¬ 
volve  any  loss  of  generality  since  asymptotically  stable  modes  of  the  reference  model  do  not  affect 
the  tracking  performance  of  the  plant,  and  Assumption  2.3  is  satisfied  by  essentially  all  practical 
disturbances. 

Before  stating  the  problem,  we  make  the  following  preliminary  definition. 

Definition  2.5  The  data  ( D ,  2oo>  £0 o)  is  said  to  be  admissible  for  state  feedback  [output  feedback] 
if  D  is  a  nonnegative  real  number,  and  Z ^  and  £00  are  respectively  a  subset  of  IRn+m  []R2n+m] 
and  1RP  both  containing  the  origin  as  an  interior  point. 

The  objective  is  to  provide  a  design  algorithm  for  solving  the  following  problems. 

Problem  1  [State  feedback  robust  tracking  problem.]  Given  the  data  (D,Z oo,£oo)j  admissible 
for  state  feedback,  find,  if  possible,  a  state  feedback  law  u  —  F(x,v,w),  such  that  the  closed-loop 
system  satisfies, 
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1.  Every  trajectory  of  the  system 

f  x  —  Ax  +  Bv 


(2) 


l  v  =  satA(-Tv  +  TF{x,v,0 )  +  d) 

starting  from  X  x  V  enters  Z0 0  in  a  finite  time  and  remains  in  it  thereafter. 

2.  For  all  x(0)  G  X,  v(0)  G  V  and  u;(0)  G  W,  the  solution  of  the  closed-loop  system  is  such 
that  e(t)  enters  the  set  £0 o  in  a  finite  time  and  remains  in  it  thereafter. 

Problem  2  [Output  feedback  robust  tracking  problem.]  Given  the  data  admissible 

for  output  feedback,  find,  if  possible,  an  output  feedback  law 
(ip~  ip(0)  €  X  C  IRn 

[u  =  F(ip,v,w) 

such  that  the  closed-loop  system  satisfies 

1.  Every  trajectory  of  the  system 

{x  =  Ax  +  Bv 

v  =  sat a(-Tv  +  TF(ip,  v,  0)  +  d) 
ip  =  Ex) 

starting  from  X  x  V  x  X  enters  Z^  in  a  finite  time  and  remains  in  it  thereafter. 

2.  For  all  x(0),ip(0)  G  X,  v(0)  G  V  and  w(0)  G  W,  the  solution  of  the  closed-loop  system  is 
such  that  e{t)  enters  the  set  £0 0  in  a  finite  time  and  remains  in  it  thereafter. 

3  A  Combined  PLC/LHG  Design  Algorithm 

In  this  section,  we  recall  from  [9]  a  design  algorithm  that  is  a  combination  of  the  piecewise  linear 
LQ  control  [20]  and  the  low-and-high  gain  feedback  [18].  For  completeness,  we  organize  this 
section  as  follows.  Subsections  3.1  and  3.2  respectively  recapitulate  the  PLC  and  the  LHG  design 
techniques.  Subsection  3.3  presents  the  combined  PLC/LHG  design  algorithm. 

3.1  Piecewise  Linear  LQ  Control  Design 

Consider  the  linear  dynamical  system  subject  to  actuator  position  saturation, 

x  =  Ax  +  PsatA(u),  x(0)  G  X  C  lRn,  u  G  IRm  (3) 
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where  the  saturation  function  sat^  :  ]Rm  ->  ]Rm  is  as  defined  in  Section  2,  the  pair  (A,  B)  is 
assumed  to  be  stabilizable,  and  X  is  bounded. 

The  PLC  design  is  based  on  the  following  LQ  algebraic  Riccati  equation, 

A'P  +  PA-  PBR-'B'P  +  I  =  0  (4) 


where  R  =  diag(e)  =  diag(ei,  €2, •  •  • , em),  e,  >  0,  are  the  design  parameters  to  be  chosen  later. 

Key  to  the  PLC  scheme  is  the  notion  of  invariant  sets.  A  nonempty  subset  of  e  in  ]Rn  is 
positively  invariant  if  for  a  dynamical  system  and  for  any  initial  condition  z(0)  G  e,  x(t)  G  e  for 
all  t  >  0.  For  the  closed-loop  system  comprising  system  (3)  and  the  LQ  control  u  =  —R~1B'Px, 
simple  Lyapunov  analysis  shows  that  the  Lyapunov  level  set 

e(P,  p)  =  {x  :  x'Px  <  p},  Vp  >  0 


is  an  invariant  set,  provided  that  saturation  does  not  occur  for  all  x  G  e(P,  p).  To  avoid  the 
saturation  from  occurring,  while  fully  utilizing  the  available  control  capacity,  for  a  given  p,  e  = 
(ci,  €2,  •  •  • ,  em)  will  be  chosen  to  be  the  smallest  such  that 


M  = 


c. 


B[Px 


<  Ai, 


\fx  G  e(P,  p) 


where  Bi  is  the  zth  column  of  matrix  B  and  Uj  is  the  zth  element  of  u.  The  existence  and 
uniqueness  of  such  an  e  are  established,  and  an  algorithm  for  computing  such  an  e  is  also  given, 
in  [20].  More  specifically,  it  is  shown  through  the  existence  of  a  unique  fixed  point  that  the 
following  iteration  converges  from  any  initial  value  to  the  desired  value  of  e, 


€n+l  = 


(5) 


where 


$(e)  =  [0i  (e).  02(e),-- 
and  for  each  i  =  1  to  rn, 

The  aim  of  the  PLC  scheme  is  to  increase  the  state  feedback  gain  piecewisely  while  adhering  to 
actuator  bounds  as  the  trajectories  converge  towards  the  origin.  This  is  achieved  by  constructing 
nested  level  sets,  e0.  e\ ,  ■  •  ■ ,  en,  in  such  a  way  that  the  trajectories  traverse  successively  the  surface 
of  each  ex  and  the  control  law  is  switched  to  higher  and  higher  gains  as  each  surface  is  crossed. 
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The  procedure  for  designing  a  PLC  law  is  as  follows.  Given  the  set  of  initial  conditions 
X  C  H",  choose  an  initial  level  set  £0  as, 

£0  =  inf{e(P,p)  :  *  Ce(P,p)} 

We  denote  the  value  of  p  associated  with  £q  as  po,  and  the  corresponding  values  of  e,  R  and  P  as 
eo,  R0  and  P0  respectively.  A  simple  approach  to  determining  e0  and  po  can  also  be  found  in  [20]. 
More  specifically,  it  is  shown  that  the  size  of  £o  grows  monotonically  as  the  parameter  p  grows. 
Hence,  £0  and  po  can  be  determined  by  a  simple  iteration  procedure.  Here  we  would  like  to  note 
that,  as  explained  in  [20],  increasing  p  indefinitely  for  exponentially  unstable  A  will  not  result  in 
an  £0  that  grows  without  bound. 

To  determine  the  inner  level  sets  £j’s,  choose  successively  smaller  pi  where  pi+ 1  <  pj  for  each 
*  ~  • 1  •  i  N.  A  simple  choice  of  such  p^ s  is  the  geometric  sequence  of  the  form 

Pi  =  p0(Ap)\  i  =  0,1,2,-  ..,N 

where  the  p-reduction’  factor  A p  £  (0,1).  (Consequently,  the  values  of  e,  R  and  P  associated 
with  each  of  these  pt  s  are  denoted  as  a,  R{  and  Pi  respectively.)  For  a  discussion  on  the  choice 
of  N  and  A p,  see  [20], 

As  shown  in  [20],  a  critical  property  of  such  a  sequence  of  level  sets  £,•  is  that  they  are  strictly 
nested  in  the  sense  that  £t+i  is  strictly  inside  et  for  each  i  =  0  to  N  —  1.  Here  and  throughout 

this  report,  by  saying  set  S2  is  strictly  inside  set  Si  we  mean  that  S2  C  Si  and  their  boundaries 
do  not  overlap. 

3.2  Low-and-High  Gain  Feedback  Design 

Consider  the  linear  dynamical  system  subject  to  actuator  position  saturation,  input  additive 
disturbances  and  uncertainties, 

x  =  Ax  +  Bsat±(u  +  f{x)  +  d),  z(0)  £  A  C  IRn, u  6  ]Rm  (7) 

where  the  saturation  function  satA  :  lRm  IRm  is  as  defined  in  Section  2,  the  locally  Lipschitz 
function  /  :  IRn  — >  IRm  represents  the  input  additive  plant  uncertainties  and  d  the  input-additive 
disturbance.  The  LHG  feedback  design  for  this  system  is  given  as  follows.  First,  the  level  set 
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So  is  determined  as  in  the  PLC  design.  Correspondingly,  a  state  feedback  law  with  possibly  low 
feedback  gain  is  determined  as, 

ui  =  -Rq'B'Pqx 

A  high  gain  state  feedback  is  then  constructed  as, 
uh  —  -kR^B'Pox,  k  >  0 

The  final  low-and-high  gain  state  feedback  is  then  given  by  a  simple  addition  of  the  low  and  high 
gain  feedbacks  ul  and  uh,  viz., 

u  =  —(1  +  k)R^1B,P0x,  k  >  0 

Here  the  design  parameter  k  is  referred  to  as  the  high  gain  parameter.  As  demonstrated  in 
[18]  the  freedom  in  choosing  the  value  of  this  high  gain  parameter  can  be  utilized  to  achieve 
robust  stabilization  in  the  presence  of  input  additive  plant  uncertainties  /  (x)  and  input-additive 
disturbance  rejection.  Moreover,  the  transient  speed  for  the  states  not  in  the  range  space  of  B'Pq 
will  increase  as  the  value  of  k  increases.  To  see  this,  let  us  consider  the  following  Lyapunov 
function, 

Vo(z)  =  x'Pqx 

The  evaluation  of  V  along  the  trajectories  of  the  closed-loop  system  in  the  absence  of  uncertainties 
and  disturbances  gives, 

V  =  -x'x  -  x'P0BRq1B'P0x  +  2x'P0B[satA(-(k  +  1)Rq1B'P0x)  +  R^B'Pqx] 

m 

=  -x'x  -  x'PqBRq 1B'Pqx  -  2  ^Uj[satA,-((fc  +  1)^0  —  «*] 

i=i 

where  we  have  denoted  the  ith  element  of  v  =  —R^B'Pox  as  V{.  By  the  choice  of  Pq,  it  is  clear 
that  \vi\  <  A,  and  hence  — v* [satA* ((^  +  l)vj)  -  «i]  <  0,  for  each  i  =  1  to  m.  For  any  x  not  in 
the  range  space  of  B'Pq,  that  is  B'Pqx  ^  0,  then,  for  any  i  such  that  vt  /  0,  -Vi[s&tAi({k2  + 
l)uj)  -  Ui]  <  -Ui[satAi((fci  +  l)uj)  -  Uj]  if  k2  >  ki.  However,  for  any  x  in  the  null  space  of  B'Pq, 
— Ui[sat Ai((k  +  l)ui)  -  Ui]  =  0  for  any  value  of  k. 
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3.3  Combined  PLC/LHG  Feedback  Design 

In  this  subsection,  we  present  the  proposed  combined  PLC/LHG  state  feedback  design  for  linear 
systems  subject  to  actuator  rate  saturation  (1). 

The  feedback  control  law  design  is  carried  out  in  the  following  three  steps. 

Step  1.  Choose  a  pre- feedback 


U  —  V  +  u 

Let  x  =  [x  ,v']'.  Then  the  system  (1)  under  the  above  pre-feedback  is  given  by, 
i  =  Ax  +  Bs&tA{Tu  +  d),  i(0)  G^xVc  lRn+m 

where 


(8) 


(9) 


(10) 

Assumption  2.1,  i.e.,  the  pair  (A,  B)  is  stabilizable,  implies  that  (A,  B)  is  stabilizable. 

Step  2.  Apply  the  PLC  state  feedback  design  algorithm  to  system  (9),  and  obtain  a  sequence  of 
nested  level  sets  £0,  £i,  •  ■  ■ ,  £n  (and  correspondingly,  the  parameters  e„,  ei,  •  •  • ,  eN)  and  a  piecewise 
linear  state  feedback  law, 

'  =  -T~lR~lB'PiX  for  x  e  £i\  £i+1,  t  =  0, 1,  •  •  • ,  N  -  1 


’ A 

B~ 

"O' 

A  = 

.0 

0. 

,  B  = 

J. 

u  = 


.UN  =  -T  lRNlB'PNx  for  x  E  eN 


(11) 


Step  3.  Design  the  LHG  state  feedback  based  on  the  PLC  feedback  law  (11)  and  obtain  the 
following  combined  final  PLC/LHG  feedback  law, 

f  ui  =  v  —  (k  +  l)T~1R~1B'Pix  for  i  €  £i\£i+i,  i  =  0, 1,  •  •  • ,  AT-1 
l  un  =  v  —  (k  +  1)T  1Rn1B'Pnx  for  x  €  £n 
where  k  >  0  is  a  design  parameter  to  be  specified  later. 
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4  Robust  Tracking  via  State  Feedback 

In  this  section,  we  will  show  that  the  combined  PLC/LHG  design  technique  as  recalled  in  the 
previous  section  can  be  used  to  construct  feedback  laws  that  solve  the  state  feedback  robust 
tracking  problem  (i.e.,  Problem  1)  as  formulated  in  Section  2.  The  state  feedback  results  are 
presented  in  a  theorem  as  follows. 
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Theorem  4.1  Let  Assumptions  2. 1-2.3  hold,  and  given  the  data  (D,  W oo>£oo)  admissible  for  state 
feedback,  then  Problem  1  is  solvable  if  there  exist  matrices  II  and  F  such  that 


1.  they  satisfy  the  linear  matrix  equations, 

(US  =  AU  +  BT 

(13) 

l  CU  +  Q  =  0 

where  A  and  B  are  as  given  by  (10),  and  C  =  [C  0pXm]- 

2.  there  exists  a  6  =  [<Ji,  62,  •  •  • ,  <5m],  >  0,  such  that  |rjio(t)|  <  A*  —  <5*  for  all  w( 0)  G  W  and 

all  t  >  0,  where  Tj  is  the  ith  row  ofT. 

Moreover,  the  feedback  laws  that  solve  Problem  1  can  be  explicitly  constructed  using  the  combined 
PLC/LHG  design  technique  as  recalled  in  the  previous  section. 


Remark  4.2  It  follows  from  [4]  that  Condition  1  is  necessary  for  solving  Problem  1,  while,  similar 
arguments  as  in  [13]  show  that,  under  some  extra  mild  conditions  on  the  plant  and  the  reference 
model,  Condition  2  is  also  necessary. 


Proof.  We  will  prove  this  theorem  by  explicitly  constructing  a  family  of  combined  PLC/LHG 
state  feedback  laws,  parameterized  in  a  parameter  k  and  showing  that  there  exists  a  k*  >  0  such 
that  for  any  k  >  k*,  the  feedback  law  solves  Problem  1. 

Construction  of  Parameterized  State  Feedback  Laws 

Step  1.  Carry  out  Step  1  of  the  combined  PLC/LHG  feedback  design  of  Section  3.3  as  follows. 
Choose  a  pre-feedback 

u  =  v  +  u  (14) 

Let  x  =  [x' ,v')' ,  Then  system  (1)  under  the  above  pre-feedback  becomes, 

x  =  Ax  +  Bsat/^(Tu  +  d)  (15) 

where  A  and  B  are  given  by  (10).  Also  let  i(0)  G  E  =  (A  x  V)  U  {x  —  niu  :  x  G  X  x  V,w  G  W). 

Step  2.  Carry  out  Step  2  of  the  combined  PLC/LHG  feedback  design  of  Section  3.3  using  5, 
instead  of  Aj  and  E  instead  of  X  x  V,  and  obtain  a  sequence  of  nested  level  sets  Eq,E\,  ■  ■  ■  ,en 
(and  correspondingly,  the  parameters  eo,  ei,  •  •  • ,  e;v)  and  a  piecewise  linear  feedback  law, 

(  Ui  —  -T~lR~lB'Plx  for  x  G  e*  \e»+i,  i  =  °,  1,  •  ■  • ,  N  -  1 
u  —  <  _  _  (16) 

[  un  =  —  T~1Rn1B'Pnx  for  x  G  en 
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Step  3.  Design  the  final  combined  PLC/LHG  tracking  control  laws  as, 

'  Ui  =  v  -  (k  +  l)T~1R~lB'Pix  +  T-1[(/c  +  +  T]w, 

for  x  €  ej\e,+i,  i  =  0, 1,  •  •  •  ,N—1 

(17) 

■  Un  ~  v  ~  &  Pn%  +T  :[(A;  +  1)B’ RNl P^Tl  +  T]w  for  x  G  £/v 

where  Ac  >  0  is  a  design  parameter  to  be  specified  later. 

We  now  proceed  to  show  that, 

Point  1.  there  exists  a  k{  >  0  such  that,  for  all  k  >  k{,  Item  1  of  Problem  1  holds; 

Point  2.  there  exists  a  k*2  >  0  such  that,  for  all  k  >  k*2,  Item  2  of  Problem  1  holds, 

from  which  the  result  of  Theorem  4.1  follows  with  k*  =  ma x{k*,k2}. 

The  existence  of  such  a  k{  follows  readily  from  [9].  It  remains  to  show  the  existence  of  k2  >  0. 
To  do  so,  let  us  introduce  an  invertible,  triangular  coordinate  change  f  =  x-Uw.  Using  Condition 
1  of  the  theorem,  we  have, 

i  —  +  .B[satA(— Tv  +  Tu  +  d)  —  Tio]  ^g^ 

where  u  is  given  by  (17).  Let  E,*,  C  Hn+m  be  such  that  £  6  E^  implies  that  C£  €  Too-  We  next 
show  that  there  exists  a  k*2  >  0  such  that  for  any  k  >  k*2  the  solution  of  (18)  starting  from  any 
£(0)  enters  the  set  -r-oo  in  a  finite  time  and  remains  in  it  thereafter.  This  can  be  done  in  two 
steps.  In  the  first  step,  we  show  that,  for  each  i  =  0  to  IV  -  1,  there  exits  a  k2i  >  0,  such  that 
for  all  k  >  k2i,  in  the  presence  of  any  d  satisfying  Assumption  2.3,  all  trajectories  starting  from 
6i  \  £i+i  will  remain  in  and  enter  into  the  inner  level  set  ei+i  in  a  finite  time.  This  in  turn 
implies  that,  for  any  k  >  maxj^o,  k2l,  ■■■ ,  k2N_  x},  all  the  trajectories  of  the  closed-loop  system 
starting  from  x  C  e0  will  enter  the  inner-most  level  set  eN  in  a  finite  time.  The  second  step  of 
the  proof  is  to  show  that  there  exists  a  k2N  >  0  such  that,  for  all  k  >  k2N,  all  the  trajectories  of 
the  closed-loop  system  starting  from  e n  will  remain  in  it  and  enter  and  remain  in  the  set  E^  in  a 
finite  time.  Once  these  two  steps  are  completed,  the  proof  of  Point  2  is  then  completed  by  taking 
k^D,  Eoo)  =  max{^0,^1,---,A:*JV}. 

We  start  by  considering  the  closed-loop  system  (18)  for  £  €  e,-  \  ei+1)  %  =  0  to  N, 

£  =  M  +  £[sat(— (A;  +  1  )R-lB'P£  +  Vw  +  d)~  Tu;] 

=  {A-  BR~lB' Pi)^  +  B[satA(-(/c  +  l)R-lB'P£  +  Yw  +  d)  +  R-lB'p£  -  Tu;]  (19) 
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where  en+i  =  0. 

Pick  the  Lyapunov  function, 

Vi  =  (Pit  (20) 

The  evaluation  of  Vi  along  the  trajectories  of  the  closed-loop  system  (19)  gives, 

Vi  =  -£'£  ~  (PiBR^B'Pit  +  2£'PiB[satA(-(k  +  ^R^B'P^  +  Tw  +  d)+  R^B'P^  -  Tw] 

m 

<  ~£'£  -  2  5Z  Vi[sa.tAi((k  +  l)«i  +  9i  +  di)  —  w*  —  &i] 

1=1 

where  we  have  denoted  the  ith  elements  of  v  =  — R~lB'Pi£ ,  Tw  and  d  respectively  as  Vi,  di  and 

di. 

By  the  construction  of  £j,  it  is  clear  that  |u;  +  0,j  <  Aj  for  all  £  G  e*.  Hence  we  have, 

Ifcujl  >  [dj|  =>  Vi[satAi((k  +  l)vj  +  6{  +  di)  —  v*  —  di)  >  Vi[satA].(uj  +  di)  -  Vi  -  d,]  =  0 

and, 

2|dj2 

\kvi\  <  \dx\  =»  t»i[satAi((fc  +  l)uj  +  dl  +  di)  —  V;  -  0j]  <  *— 

where  we  have  used  the  fact  that, 


|satAi(ui)  -satAj(v2)|  <  |t»i  -v2\,  Vui,v2  G  3R 

Hence,  we  can  conclude  that,  for  all  £  €  £i  \  £j+i,  i  =  0  to  N  —  1, 

4||d||2  ,  AD2 

Vi<-t't  +  ^JL<-t't  +  — 

To  complete  the  first  step  of  the  proof,  for  each  i  =  0  to  N  —  1,  we  let 

5Amax(Pi+i)-C)2 


k*(D)  = 


Pi+l 


We  then  have  that,  for  all  k  >  k*, 
Vi  < 


ThV)  r+1  *  j 


and  hence, 


(21) 


(22) 


V  <0,  Vx  G  Ei  \  Ei+ 1 


(23) 
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which,  together  with  the  fact  that  £i+1  is  strictly  inside  £*,  shows  that  all  the  trajectories  of  the 

closed-loop  system  starting  from  £*  \  £,+1  will  remain  in  £j  and  enter  the  level  set  £i+1  in  a  finite 
time. 


For  the  second  step  of  the  proof,  let  pN+1  e  (0 ,p/v]  be  such  that  e(PN,pN+1 )  c  £qo-  The 
existence  of  such  a  pN+1  is  due  to  the  fact  that  contains  the  origin  of  the  state  space  as  an 
interior  point.  Choose  k%N(D,  E^)  as  follows, 

u*  i  n  ^  i _  5Amax(P/v)Z?2 

KN\V1-  ooj - - - 

PN+l 

We  will  also  have  that,  for  all  k  >  k$N, 


VN  < 
and  hence, 


1 

^max  (Pn) 


4A  max(PN)D2\ 

k  J 


(24) 


Vjv  <0,  V£  £  £at  \  e(PN,  pn+i)  (25) 

which  shows  that  all  trajectories  of  the  closed-loop  system  starting  from  eN  will  remain  in  eN  and 
enter  the  set  e(Pn ,Pn+ i)  C  —  oo  in  a  finite  time  and  remain  in  it  thereafter. 

So  far,  we  have  shown  that  any  trajectory  of  the  closed- loop  system  (19)  starting  from  the  set 
“  enter  the  set  r.00  in  a  finite  time  and  remain  in  it  thereafter.  Noting  Condition  1  of  the 
theorem  implies  that  e  =  C£,  and  recalling  that  £  6  implies  that  e  e  £«,,  which  concludes 
the  proof  of  Theorem  4.1.  n 


5  Robust  IVacking  via  Output  Feedback 

In  this  section,  we  consider  the  case  that  only  the  output  is  available  for  feedback,  i.e.,  the  output 
feedback  robust  tracking  problem.  We  will  use  a  fast  observer  to  preserve  the  performance  of  the 
combined  PLC/LHG  state  feedback  laws  proposed  in  the  previous  section.  In  order  to  build  the 
fast  observer,  we  make  the  following  assumption, 

Assumption  5.1  The  pair  ( A,E )  is  observable. 

Our  output  feedback  result  is  given  as  follows. 

Theorem  5.2  Let  Assumptions  2. 1-2.3  and  5.1  hold,  and  given  the  data  (£>,W0O,£00)  admissible 
for  output  feedback,  then  Problem  2  is  solvable  if  there  exist  matrices  II  and  T  such  that 
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1.  they  satisfy  the  linear  matrix  equations, 

ns  =  in  +  BT 

(26) 

cn  +  q  =  o 

where  A  and  B  are  as  given  by  (10),  and  C  =  [C  OpXm]- 

2.  there  exists  a  5  =  [<5i ,  62,  •  ■  ■ ,  5m],  6{  >  0,  such  that  ]Pju;(t)|  <  A i  —  Si  for  all  w(0 )  G  W  and 
all  t  >  0,  where  Tj  is  the  ith  row  ofT. 

Moreover,  the  output  feedback  laws  that  solve  Problem  2  can  be  explicitly  constructed  by  imple¬ 
menting  the  state  feedback  law  of  the  previous  section  with  a  fast  observer. 

Proof:  We  will  prove  this  theorem  by  explicitly  constructing  a  family  of  fast  observer  based 
combined  PLC/LHG  output  feedback  laws,  parameterized  in  two  parameters  k  and  £  and  showing 
that,  there  exists  a  k*  >  0,  and  for  each  k  >  k*,  there  exists  an  £*{k)  >  0  such  that,  for  any 
t  >  i*(k),  k  >  k* ,  the  output  feedback  law  solves  Problem  2. 

Construction  of  Parameterized  Output  Feedback  Laws 

Step  1:  Fast  Observer  Design.  The  family  of  fast  observers,  parameterized  in  t  is  given  by, 
x  =  Ax  +  L{£)(y  —  Cx)  +  Bv,  £(0)  G  X  (27) 

where  L(£),  £  >  0  is  chosen  such  that 

\{A  +  L{£)C)  =  iA  (28) 

where  A  is  any  set  of  n  complex  numbers,  with  negative  real  parts  and  close  under  complex 
conjugation. 

Step  2:  State  Feedback  Design.  Carry  out  the  state  feedback  design  as  in  the  proof  of 
Theorem  4.  In  the  design,  instead  of  letting  S  =  {X  x  V)  U  (5  —  IIu;  :  x  G  X  x  V,  w  G  W},  let  5 
be  any  bounded  set  such  that  (X  x  V)  U  {x  -  Iky  :  x  G  X  x  V,  w  G  W}  is  strictly  inside  it.  Let 
the  resulting  state  feedback  laws  be  given  by, 

Ul  =  v  -  (k  +  1  )T-lR-lB’PiX  +  T-1[(fc  +  1  )B'R-1PiU  +  T]w, 

for  x  G  £i\ei+i,  i  =  0, 1,  •  •  •  ,N  —  1 

(29) 

UN  =  v  -  (k  +  l)T-lR-NlB'PNx  +  T~l[(k  +  VB'RJj'PftTL  +  T]w  for  x  £  eN 
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where 


£i  =  {i  e  Hn+m  :  x'PiX  <  Pi },  i  =  0, 1,  •  •  • ,  JV 

iS  the  sequence  of  strictly  nested  level  sets  and  k  >  0  is  a  design  parameter  to  be  specified  later. 

Step  3:  Output  Feedback  Laws.  The  final  output  feedback  laws  are  obtained  by  implementing 
the  state  feedback  law  as  obtained  in  Step  1  with  the  fast  observer  as  obtained  in  Step  2.  The 
resulting  output  feedback  law  is  given  by, 

'  ui  =  V  -  (k  +  1  )T-lR-lB'Pii  +  T-1[(fc  +  1  )B'R-lPin  +  T]w, 

for  £  E  £i\  e,+i,  i  =  0, 1,  ■  •  • ,  JV— 1 

u  ~  (30) 

.  uN  =  v  -  (k  +  \)T-lR-lB'PNi  +  T~l[{k  +  l)B'R-NlPNTl  +  ]>  for  £  e  eN 
where  x  =  {x\  v']' . 

We  next  proceed  to  show  that, 

Point  1.  there  exists  a  k{  >  0,  and  for  each  k  >  k\,  there  exists  an  il(k)  >  0  such  that,  for  all 
i  >  ^l(k),  k  >  kl,  Item  1  of  Problem  2  holds; 

Point  2.  there  exists  a  >  0,  and  for  each  k  >  k%,  there  exists  an  t^k)  >  0  such  that,  for  all 
£  >  £2 (k),  k  >  k-2,  Item  2  of  Problem  2  holds, 

from  which  the  results  of  Theorem  5.2  follows  with  k*  =  max{fc*,  fcj},  and  for  each  k  >  k*, 
£*(k)  =  max{£J(A:),^2(A:)}. 

Proof  of  Point  1. 

To  show  the  existence  of  Aq,  and  £\{k)  for  each  k  >  k*,  let  us  write  out  the  closed-loop  system, 
fx  =  Ax  +  Bsa.tA(-(k  +  1  )R-1B'Pix  +  {k  +  1  )R~1B'Pi^  +  d), 

<  x  G  Si  \ei+i,  f(0)  e  X  x  V  (31) 

J  =  (A-L(£)C)<j>,  cf(0)e  A 

where  x  =  [x  ,v  ]  ,  (j>  =  [0',O]'  =  [ x '  —  x^O]'.  Noting  that  A  —  L[£)C  is  Hurwitz  for  all  £  >  0 
and  hence  limt_>00^(t)  =  0,  we  need  only  to  show  that,  for  any  k>k\  and  £  >  £\{k ),  (x(t),v(t)) 
enters  x  Vqo  in  a  finite  time  and  remains  in  it  thereafter,  where  and  Voo  both  contain  the 
origin  of  as  an  interior  point  and  are  such  that  X^  x  Voo  x  *00  is  strictly  inside  the  set  Z^.  To  do 
this,  we  recall  that  X  x  V  is  strictly  inside  the  set  E.  Since  the  dynamics  of  x  is  a  linear  system 
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driven  by  a  bounded  input  whose  bound  is  independent  of  £,  it  follows  from  (28)  that  there  exist 
an  £\a  >  0  and  a  To  >  0  such  that  for  all  £>  t\a, 

x(T0)  =  (x(To),«(T0))  G  S,  x(T0)  =  (x(T0),v(T0))  G  S,  V(z(0),  v(0))  G  X  x  V 

Hence  (x(To),u(To))  G  £q.  We  next  show  that  there  exists  a  k*  >  0  and  for  each  k  >  k\  there 
exists  an  £*(k)  >  0  such  that,  for  any  l  >  £*(k),  k  >  k{,  the  trajectory  (x(t),v(t))  will  enter 
^oo  x  Voo  in  a  finite  time  and  remain  in  it  thereafter.  This  can  be  done  in  two  steps.  In  the 
first  step,  we  show  that,  for  each  i  =  0  to  N  —  1,  there  exists  a  k[{  and  for  each  k  >  there 
exists  an  £*xi (k)  >  0  such  that  for  all  i  >  £*i(k),  k  >  k all  trajectories  (x(t),v(t))  starting 
from  £{  \  £i+i  will  remain  in  and  enter  into  the  inner  level  set  el+x  in  a  finite  time.  This  in 
turn  implies  that,  for  any  £  >  max{^|0(A:),  £h{k),  ■  ■  ■ ,  ^Ar_1  (A:)},  k  >  max{fcj0, kxl,  ■  ■  • ,  klN_x}, 
all  trajectories  (x(t),v(t))  starting  from  eq  will  enter  the  inner-most  level  set  en  in  a  finite  time. 
The  second  step  of  the  proof  is  to  show  that  there  exists  a  k\N  >  0  and  for  each  k  >  k\N  there 
is  an  £*lN  >  £\a{k )  such  that,  for  all  i  >  k  >  k\N,  all  the  trajectories  (x(t),v(t))  starting 

from  em  will  enter  and  remain  in  the  set  x  in  a  finite  time.  Once  these  two  steps  are 
completed,  the  proof  of  Point  1  is  then  completed  by  taking  k*  =  max{A;j0,  kxl,  •  •  • ,  k^N}  and  for 
each  k  >  k{,  £\(k)  =  max{^0(fc),^1(A:),  •  •  •  ,£\N(k)}. 

We  start  by  considering  the  closed-loop  system  (31)  for  ( x,v )  G  £j  \  £*+ 1  and  t  >  To, 

J  x  =  Ax  +  BsaXj\{—{k  +  1  )R~lB'PiX  +  (k  +  1  )R~1B'Pi<j)  +  d), 

U  =  [A  -  L(£)C)<t>,  (f>{0)  G  X  '  J“  ' 

Pick  the  Lyapunov  function 

Vi(x)  =  xPli.  (33) 


Recalling  that 


'  x' 

1 

H 

1 _ 

.v . 

.  v  . 

we  have 


Vi(x)  =  x'PiX  —  2  x'  Pi4>  +  4>'Pi4> 


(34) 
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Now  for  any  k  >  0,  let  £*lb(k)  >  £\a  be  such  that,  for  any  i  >  £\b(k),  k  >  0,  and  for  all 
(x(0),u(0),x(0))  €  X  x  V  x  X, 

[\\{k  +  \)R-lB'P^{t)\\<D,  it  >  To 
\\x'Pi[A  -  L(e)C]4>(t) ||  <  Si,  V(x,v)  e  e0,  Vi  >  T0 
||^,(t)Pi[ix  +  JBsatA(-)]||  <  x»  v(^«)  G  eo,  Vi  >  T0 
1  ||^(i)Pi[i4  -  L(c)C]0(i)||  <  Si,  Vi  >  T0 

Let  d  =  (k  +  1  )R~lB'Pi4>(t)  +  d.  In  view  of  Assumption  2.3  and  (35),  we  have  that 


(35) 


||d||  <  2D,  it  >  T0 


(36) 


Viewing  d  as  disturbance  and  in  view  of  (35),  it  follows  from  the  same  arguments  as  used  for 
arriving  at  (21)  that 


Vi  <  —x'x  + 


22  D2 


<  - 


1 


'i'Pt-ni  -  ,vi  g  Ci  \  eM  (37) 


k  ~  Amax(Pi+1)  k  , 

For  each  k ,  let  tXi{ k )  >  £\b{k)  be  such  that,  for  all  t  >  To,  x  Pj+ \x  >  pi+ 1  implies  that  x'Pi+ \x  > 
hpi+i-  Also  let 


k*u(D)  = 


45Amax(Pi+1)P2 


Pi+ 1 


We  then  have  that,  for  all  £  >  £*u(k),  k  >  k*xi, 
Vi  <  0,  ix  E  Si  \  f-i 


(38) 


(39) 


which  implies  that  all  trajectories  (x(f),u(f))  of  the  closed-loop  system  starting  from  e*  \^+1  will 
remain  in  Ei  and  enter  the  inner  set  Sj+i  in  a  finite  time  and  remain  in  it  thereafter. 

For  the  second  step  of  the  proof  of  Point  1,  let  p/v+i  6  (0,  p^]  be  such  that  e{Pn, Pn+i)  C 
Voo  x  Vqq.  The  existence  of  such  a  p^+i  is  due  to  the  fact  that  X^  x  Vqo  contains  the  origin  of 
the  state  space  as  an  interior  point.  We  will  have, 

1  22An3ax(WD!\ 


VN<- 


xPtfX  — 


,  ix  E  E;v 


(40) 


,  ~  i  a/ j.  ru  a/  I 

Amax(Piv)  V  k  ) 

For  each  k  >  0,  let  i*N{k)  >  £\b{k)  be  such  that,  for  all  t  >  To,  x  Pjvx  >  pn+\  implies  that 
x'Pnx  >  Ipa'+i.  Also  let 

45A  max(P^)P2 


Kn{D) 


Pn+i 


(41) 
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We  then  have  that,  for  all  t  >  t\N{k),  k  >  k*N, 

V/v  <  0,  Vi  6  £n  \  z{Pn,  Pn+i)  (42) 

which  shows  that  all  trajectories  (f(f),v(t))  of  the  closed-loop  system  that  start  from  will 
remain  in  and  enter  the  set  e(.P/v,  Pn+i)  C  X^  x  in  a  finite  time  and  remain  in  it  thereafter. 
This  completes  the  proof  of  Point  1.  We  next  proceed  to  prove  Point  2. 

Proof  of  Point  2. 

Let  us  start  with  the  introduction  of  an  invertible,  triangular  coordinate  change  £  =  x  —  IIu>. 
Using  Condition  1  of  the  theorem,  we  can  write  the  closed-loop  system  as  follows, 

'  £  =  M  +  S[satA(— (A;  +  1  )R-1B'Pi^  +  (k  +  1  )R-lB'Pif>  +  Tw  +  d)  -  rto], 

<  x  E  £i\  £i+ 1,  i(0)  e  X  x  V  (43) 

J  =  {A  -  L{t)C)4>,  0(0)  e  x 

which,  except  for  the  terms  Tm,  is  exactly  the  same  as  (31).  The  rest  of  the  proof  follows  the 
arguments  used  in  the  proof  of  Point  2  of  Theorem  4.1  and  those  used  in  the  proof  of  Point  1  of 
the  current  theorem,  Theorem  5.2.  □ 


6  Design  for  Robust  Tracking  for  an  F-16  Aircraft 


In  this  section,  the  applicability  and  effectiveness  of  the  proposed  combined  PLC/LHG  design 
algorithm  is  demonstrated  with  an  F-16  fighter  aircraft  derivative.  At  the  flight  condition  corre¬ 
sponding  to  an  altitude  of  10,000  feet  and  a  Mach  number  of  0.7,  the  second  order  pitch  plane 
dynamics  (short  period)  of  this  aircraft  is  given  by  [16], 


"a" 

’  —1.1500 

0.9937' 

'a' 

A. 

3.7240 

-1.2600. 

.q. 

+ 


-0.1770 

-19.5000 


<5>  |a(0)|  <  O.lrad,  |g(0)|  <  0.5rad/sec 


8  =  sati(— TS  +  Tu  +  d),  |<5(0)|  <  O.lrad 

w ,  |u>i(0)|  <  0.5rad/sec,  i  =  1, 2 
+  [0  1  ]  w 


w 


(i  1 
-1  0 


e  —  [0 

i] 

a 

.q . 

y  =  [i 

o] 

’a' 

.q. 
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where  a,  q  and  S  are  respectively  the  trim  value  of  the  angle  of  attack,  pitch  rate  and  stabilizer 
deflection.  Here  the  actuator  is  rate  limited  to  ±1.0  rad/sec.  The  constants  T  =  20  sec,  which 
corresponds  to  an  actuator  bandwidth  of  20  rad/sec. 

It  is  straightforward  to  see  that  the  above  system  is  in  the  form  of  (1).  Our  goal  here  is  to 
design  an  effective  robust  tracking  control  law  that  will  cause  the  pitch  rate  q  to  track  a  class 
of  sinusoidal  reference  signals  in  the  presence  of  actuator  disturbance  dL.  The  control  law  to  be 
designed  is  the  combined  PLC/LHG  feedback  law  proposed  in  the  previous  two  sections. 

State  Feedback  Design 

In  the  case  that  both  states  a  and  q  are  available  for  feedback,  we  will  design  state  feedback  laws  as 
proposed  in  Section  4.  We  begin  by  checking  the  conditions  of  Theorem  4.1.  It  is  straightforward 
to  check  that  Condition  1  is  satisfied.  More  specifically,  the  matrices 
-0.4141  -0.4993 

0  -1 
-0.1304  -0.0307 

solve  the  linear  matrix  equations  (13).  It  is  also  straightforward  to  verify  that  \Tw(t)\  <  0.2  for 
all  ro(0)  €  W  and  for  all  t  >  0.  Hence,  Condition  2  of  the  theorem  is  satisfied  with  5  =  0.8. 
Following  the  design  procedure  given  in  Section  4,  we  get 

S  =  {(6,6,6)  :  161  <  0.5567,  |&|  <  1,  |6I  <  0.1805} 
and  find, 


‘  2.0409  1.0217  -10.2609 

Po  =  9.1920,  e0  =  26.8917,  P0  =  1.0217  0.7091  -5.7828 

.-10.2609  -5.7828  78.6731 

Choosing  N  =  5  and  A p  =  0.5,  we  obtain  a  feedback  law  of  the  form  (17)  with, 

'  1.5809  0.7792  -6.9023 

Pi  =  4.5960,  ex  =  15.0397,  Px  =  0.7792  0.5805  -4.0412 

.-6.9023  -4.0412  49.2153 

■  1.2568  0.6043  -4.7011 

p2  =  2.2980,  e2  =  8.4670,  P2  =  0.6043  0.4849  -2.8789 

.-4.7011  -2.8789  31.1964 
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Figure  1:  Combined  PLC/LHG  design  (N  =  5,  k  =  40).  (a(0),9(0),<5(0),uti(0),u;2(0))  = 

(-0.1, 0.5, 0.1, 0,0.5)  and  d  —  5sin(4f  +  3).  The  left  plot:  the  plant  output  Cx  (dashed  line) 
and  the  reference  signal  —  Qw  (solid  line).  The  right  plot:  the  actuator  rate  6. 


■  1.0252 

0.4753 

-3.2344' 

p3  =  1.1490,  e3  =  4.7942, 

^3  = 

0.4753 

0.4118 

-2.0852 

.-3.2344  - 

-2.0852 

20.0040. 

'  0.8580 

0.3784 

-2.2435' 

Pi  =  0.5745,  e4  =  2.7281, 

Pi  = 

0.3784 

0.3541 

-1.5314 

.-2.2435  - 

-1.5314 

12.9550. 

'  0.7363 

0.3045 

—  1.5666  - 

P5  =  0.2873.  e5  =  1.5590, 

P5  = 

0.3045 

0.3075 

-1.1376 

.-1.5666  - 

-1.1376 

8.4611. 

We  note  here  again  that  this  control  law  reduces  to 

an  LHG 

feedback 

to  a  PLC  law  if  k  =  0. 

Figs.  1  and  2  are  simulation  results.  In  these  simulations,  (a(0),g(0),<5(0),u;i(0),u>2(0))  = 
(-0.1, 0.5, 0.1, 0,0.5),  and  d  =  5sin(4f  +  3).  They  show  good  utilization  of  the  available  actuator 
rate  capacity,  and  that  the  degree  of  tracking  accuracy  in  the  presence  of  actuator  disturbance 
increases  as  the  value  of  the  parameter  k  increases. 
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Figure  2:  Combined  PLC/LHG  design  (N  =  5,  k  =  400).  (ck(0), g(0), 5(0), u/i (0), u^(0))  = 
(—0.1, 0.5, 0.1, 0, 0.5)  and  d  =  5sin(4t  +  3).  The  left  plot:  the  plant  output  Cx  (dashed  line)  and 
the  reference  signal  —Qw  (solid  line).  The  right  plot:  the  actuator  rate  5. 

We  next  compare  the  performance  of  the  combined  PLC/LHG  design  with  both  the  PLC  and 
the  LHG  design.  To  compare  with  the  PLC  laws,  we  set  k  -  0  in  (17).  The  simulation  (Fig.  3) 
shows  that  in  the  presence  of  the  same  disturbance,  the  aircraft  becomes  unstable.  To  compare 
with  the  LHG  feedback  law,  we  set  N  =  0  in  (17).  The  simulation  (Fig.  4)  shows  that  the 
combined  PLC/LHG  feedback  law  results  in  much  better  transient  performance.  By  increasing 
the  value  of  N,  the  transience  performance  can  be  further  improved. 

Output  Feedback  Design 

In  the  case  that  only  the  output  (i.e.,  a)  is  available  for  feedback,  we  design  output  feedback  laws 
as  proposed  in  Section  5.  We  have  already  checked  in  the  state  feedback  case  that  all  condition 
of  Theorem  5.2  are  satisfied  except  Assumption  5.1,  which  is  also  satisfied.  Following  Step  1  of 
the  output  feedback  design  algorithm,  we  obtain  the  fast  observer  as  given  by  (27)  with 

L(£)  =  [2* -  2.41  1.0063£2  —  2.536^  +  5.3216  ]' 

We  note  here  that,  with  this  choice  of  L(£), 

\(A-L(£)C)  =  {-e,-£} 
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Figure  3:  PLC  Design  {N  =  5,  k  =  0).  (a(O).g(O), (0),u^(0))  =  (-0.1, 0.5, 0.1, 0,0.5)  and 
d  =  5sin(4£  +  3).  The  left  plot:  states  of  the  plant  and  the  actuator.  The  right  plot:  the  actuator 


Time  (sec) 


Time  (sec) 


Figure  4:  Transience  performance:  PLC/LHG  (N  =  5,  k  =  40,  solid  lines)  vs.  LHG  (N  =  0,  k  = 
40,  dashed  lines).  (a(0),  g(0),  5(0),  w\  (0),  ^(0))  =  (—0.1, 0.5, 0.1, 0,0.5)  and  d  =  0.  The  left  plot: 
tracking  error  e.  The  right  plot:  the  actuator  rate  5. 


Proceeding  with  Step  2  of  the  design,  we  first  choose 


s  =  {(6,6,6)  :  161  <  0.56,  161  <  l.oi,  (61  <  0.185} 
and  find, 

'  2.0993  1.0521  -10.7036 ' 

po  =  9.8822,  €q  =  28.5865,  P0  =  1.0521  0.7250  -6.0107 

.-10.7036  -6.0107  82.6928. 

Choosing  N  =  5  and  A p  =  0.5,  we  obtain  a  state  feedback  law  of  the  form  (29)  with, 

■  1.6217  0.8010  -7.1897- 

Pi  =  4.9411,  ei  =  15.9758,  Pi  =  0.8010  0.5921  -4.1914 

.-7.1897  -4.1914  51.6538. 

'  1.2857  0.6201  —4.8911  ’ 

p2  =  2.4705,  e2  =  8.9881,  P2  =  0.6201  0.4937  -2.9803 

.-4.8911  -2.9803  32.6997. 

-  1.0460  0.4871  -3.3619- 

p3  =  1.2353,  e3  =  5.0865,  P3  =  0.4871  0.4186  -2.1552 

.-3.3619  -2.1552  20.9446. 

’  0.8731  0.3873  -2.3302' 

p4  =  0.6176,  e4  =  2.8930,  P4  =  0.3873  0.3596  -1.5807 

.-2.3302  -1.5807  13.5511. 

'  0.7473  0.3113  —1.6261“ 

p5  =  0.3088,  e5  =  1.6526,  P5  =  0.3113  0.3120  -1.1730 

.-1.6261  -1.1730  8.8436. 

Implementing  the  state  feedback  law  with  the  fast  observer  state,  both  obtained  as  above,  we 
have  the  output  feedback  law  in  the  form  of  (30). 

Some  simulation  results  are  shown  in  Fig.  5.  In  the  simulation,  (a(0),  g(0),  £(0),u>i  (0),m2(0))  = 
(—0.1, 0.5, 0.1, 0, 0.5),  d(t)  =  5sin(4t  +  3),  which  are  the  same  as  in  the  simulations  in  the  state 
feedback  case,  and  (fi(0),®2(0))  =  (0,0).  With  k  =  400,  l  =  1000,  these  simulations  results  are 
almost  the  same  as  those  shown  in  Fig.  2. 


S  -0.2 
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_  0 
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Time  (sec) 
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Figure  5:  PLC  Design  {N  =  5,  k  =  0).  (ot(0) ,  g(0) ,  5(0) , (0), (0) , ® i (0) , *2 (0) )  = 
(-0.1, 0.5, 0.1, 0, 0.5, 0, 0)  and  d  =  5  sin(4f  +  3)..  The  upper  left  plot:  the  plant  output  Cx  (dashed 
line)  and  the  reference  signal  —  Qw  (solid  line).  The  upper  right  plot:  the  actuator  rate  <5.  The 
lower  plots:  plant  states  x(t)  (solid  line)  and  their  estimates  x  (dashed  line). 
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7  Conclusions 


A  design  technique  we  recently  proposed  for  stabilization  of  a  linear  system  with  rate-limited 
actuators  is  utilized  to  design  feedback  laws  that  cause  the  system  output  to  track  a  desired  com¬ 
mand  signal.  This  design  method  is  based  on  two  design  techniques  recently  developed  for  linear 
systems  with  position  limited  actuators  and  retains  the  advantages  of  both  design  techniques, 
while  avoiding  their  disadvantages.  Application  of  the  proposed  design  to  an  F-16  fighter  aircraft 
demonstrates  the  applicability,  robustness  and  effectiveness  of  this  novel  control  law. 


References 

[1]  J.M.  Berg,  K.D.  Hammett,  C.A.  Schwartz,  and  S.S.  Banda,  “An  analysis  of  the  desta¬ 
bilizing  effect  of  daisy  chained  rate-limited  actuators,”  IEEE  Trans.  Control  Sys.  Tech., 
Vol.  4,  No.  2,  1996. 

[2]  D.S.  Bernstein  and  A.N.  Michel,  “A  chronological  bibliography  on  saturating  actuators,” 
International  Journal  of  Robust  and  Nonlinear  Control ,  Vol. 5,  No. 5,  pp. 375-380,  1995. 

[3]  M.A.  Dornheim,  “Report  pinpoints  factors  leading  to  YF-22  crash,”  Aviation  Week  Space 
Technol.,  pp.  53-54,  Nov.  9,  1992. 

[4]  B.  Francis,  The  linear  multivariable  regulator  problem,”  SIAM  J.  Control  Optimization, 
Vol.  15,  486-505,  1984. 

[5]  J.M.  Lenorovitz,  “Gripen  control  problems  resolved  through  in-flight,  ground  simula¬ 
tions,”  Aviation  Week  Space  Technol.,  pp.  74-75,  June  18,  1990. 

[6]  Z.  Lin,  “Semi-global  stabilization  of  linear  systems  with  position  and  rate  limited  actua¬ 
tors,”  Systems  &  Control  Letters,  Vol.  30,  No.  1,  pp.  1-11,  1997. 

[7]  Z.  Lin,  “Global  control  of  linear  systems  with  saturating  actuators,”  Proc.  of  the  35th 
CDC,  pp.  4357-4362,  1996. 

[8]  Z.  Lin,  M.  Pachter  and  S.  Banda,  “Toward  improvement  on  tracking  performance  -  non¬ 
linear  feedback  for  linear  systems,”  International  Journal  of  Control ,  to  appear.  Also  in 
the  Proceedings  of  the  2nd  Asian  Control  Conference,  Vol.  II,  pp.  527-530,  1997. 


15-28 


[9]  Z.  Lin,  M.  Pachter,  S.  Banda  and  Y.  Shamash,  “Stabilizing  feedback  design  for  linear 
systems  with  rate  limited  actuators,”  Control  of  Uncertain  Systems  with  Bounded  In¬ 
puts,  eds.  S.  Tarbouriech  and  Germain  Garcia,  Lecture  Notes  in  Control  and  Information 
Sciences,  Springer- Verlag,  Vol.  227,  pp.  173-186,  1997. 

[10]  Z.  Lin,  M.  Pachter,  S.  Banda,  and  Y.  Shamash,  “Feedback  design  for  robust  tracking  of 
linear  with  rate  limited  actuators,”  Proceedings  of  1997  AIAA  Guidance,  Navigation  and 
Control  Conference,  AIAA  paper  97-3609,  pp. 780-788,  1997. 

[11]  Z.  Lin,  M.  Pachter,  S.  Banda,  and  Y.  Shamash,  “Feedback  design  for  robust  tracking  of 
linear  with  rate  limited  actuators  (with  output  feedback),”  submitted  for  publication. 

[12]  Z.  Lin,  A.  Saberi  and  B.M.  Chen,  Linear  Systems  Toolbox,  (Commercially  available 
through  A.J.  Controls  Inc.  Seattle,  Washington.)  Washington  State  University  Report 
No.  EE/CS  0097,  1991. 

[13]  Z.  Lin,  A.  Stoorvogel  and  A.  Saberi,  “Output  regulation  for  linear  systems  subject  to 
input  saturation,”  Automatica,  Vol.  32,  No  1,  pp. 29-47,  1996. 

[14]  A.  Megretski,  “Output  feedback  stabilization  with  saturated  control:  making  the  input- 
output  map  Z/2-bounded,”  preprint.  See  also  “L2  BIBO  output  feedback  stabilization  with 
saturated  control,”  Proc.  13th  Triennial  World  Congress  of  IFAC,  vol.  D,  pp.  435-440, 
1996. 

[15]  P.  Miotto,  J.M.  Shewchun,  E.  Feron,  and  J.D.  Paduano,  “High  performance  bounded 
control  synthesis  with  application  to  the  F18  HARV.  Proc.  of  the  AIAA  GNC,  San  Diego, 
CA,  July  1996. 

[16]  M.  Pachter,  P.R.  Chandler  and  M.  Mears,  “Reconfigurable  tracking  control  with  satura¬ 
tion,”  AIAA  Journal  of  Guidance,  Control  and  Dynamics,  Vol.  18,  pp  1016-1022,  1995. 

[17]  C.A.  Shifrin,  “Gripen  likely  to  fly  again  soon,”  Aviation  Week  Space  Technol.,  pp.  72, 
Aug.  23,  1993. 

[18]  A.  Saberi,  Z.  Lin  and  A.R.  Teel,  “Control  of  linear  systems  subject  to  input  saturation,” 
IEEE  Trans.  Auto.  Contr.,  Vol.  41,  No.3,  pp.  368-378,  1996. 


15-29 


[19]  P.  Sannuti  and  A.  Saberi,  “A  special  coordinate  basis  of  multivariable  linear  systems  - 
finite  and  infinite  zero  structure,  squaring  down  and  decoupling,”  International  J.  Control, 
Vol.  45,  pp.  1655-1704,  1987. 

[20]  G.F.  Wredenhagen  and  P.R.  Belanger,  “Piecewise-linear  LQ  control  for  systems  with  input 
constraints,”  Automatica ,  vol.  30,  pp.  403-416,  1994. 


15-30 


EXPERIMENTAL  AND  COMPUTATIONAL  INVESTIGATIONS  OF  BROMINE  AND 
IODINE  CHEMISTRY  IN  FLAME  SUPPRESSION 

Paul  Marshall 
Associate  Professor 
Department  of  Chemistry 

University  of  North  Texas 
PO  Box  305070,  Denton,  Texas  76203-5070 


Final  Report  for: 

Summer  Faculty  Extension  Program 

Wright  Laboratory,  Materials  Directorate,  Wright-Patterson  AFB,  Ohio  45433 

Sponsored  by: 

Air  Force  Office  of  Scientific  Research 
Bolling  AFB,  Washington,  D.C. 

and 

Wright  Laboratory,  Wright-Patterson  AFB 

December  1997 


16-1 


EXPERIMENTAL  AND  COMPUTATIONAL  INVESTIGATIONS  OF  BROMINE  AND 
IODINE  CHEMISTRY  IN  FLAME  SUPPRESSION 


Paul  Marshall 
Associate  Professor 
Department  of  Chemistry 
University  of  North  Texas 
PO  Box  305070,  Denton,  Texas  76203-5070 


Abstract 

The  kinetics  of  the  reaction  of  hydroxyl  radicals  with  trifluoroiodo methane  were  investigated  by  the 
flash  photolysis  -  resonance  fluorescence  technique.  A  rate  constant  of  k  =  9.9  x  10'12  exp(-12.7  kJ 
mol  ’/RT)  cm3  molecule1  s'1  was  measured  over  the  temperature  range  280-450  K  with  accuracy 
limits  of  10%  (450  K)  to  30%  (280  K).  Different  product  channels  were  investigated  by  ab  initio 
methods,  and  the  dominant  products  are  CF3  +  HOI.  The  enthalpy  of  formation  of  hypoiodous  acid 
was  analyzed  with  Gaussian-2  theory,  in  conjunction  with  G2  energies  for  INO,  ICN,  C1CN  and  other 
species.  The  transition  state  and  reaction  coordinate  for  OH  +  CF3I  was  characterized  at  the 
G2(MP2)  level,  and  the  results  suggest  a  negligible  barrier  to  the  reverse  reaction  of  CF3  +  HOI,  so 
that  the  measured  forward  activation  energy  can  be  used  to  derive  AfH29g(HOI)  =  -68  ±  5  kJ  mol1. 
The  implications  of  the  kinetics  and  thermochemistry  for  iodine  chemistry  in  flames  and  the 
atmosphere  are  discussed,  and  for  the  range  280  -  2000  K  a  proposed  rate  expression  is  k  =  2.9  x 
10'16  (T/K)1 5  exp(-960  K/T)  cm3  molecule'1  s'1. 


16-2 


EXPERIMENTAL  AND  COMPUTATIONAL  INVESTIGATIONS  OF  BROMINE  AND 
IODINE  CHEMISTRY  IN  FLAME  SUPPRESSION 


Paul  Marshall 


1.  Introduction 

Trifluoroiodomethane  is  a  possible  replacement  for  Halon-1301  (CF3Br)  as  a  fire  suppression 
agent  in  some  applications,1  following  the  ban  on  Halon  production  under  the  Montreal  Protocol  on 
Substances  that  Deplete  the  Ozone  Layer.  A  potentially  important  step  in  the  combustion  chemistry 
of  CF3I  is  the  reaction 

OH  +  CF3I  -  CF3  +  HOI  (1) 

whose  kinetics  have  been  investigated  previously  only  at  room  temperature.2,3  The  two  earlier  studies 
yielded  rate  constants  kj  that  differ  by  a  factor  of  four.  One  aim  of  the  present  work  is  to  determine 
the  Arrhenius  parameters  of  the  isolated  elementary  reaction  for  the  first  time,  in  order  to  permit 
estimation  of  kj  at  flame  temperatures.  The  second  aim  is  to  measure  at  below  room  temperature, 
in  order  to  assess  the  atmospheric  lifetime  of  CF3I  at  the  average  tropospheric  temperature  of  around 
280  K.  The  third  aim  is  to  assess  the  main  products  of  the  reaction.  Finally,  the  kinetic  measurements 
are  combined  with  ab  initio  information  about  the  potential  energy  surface  to  obtain  the 
thermochemistry  of  a  dominant  product,  hypoiodous  acid.  HOI  is  a  major  product  of  the  reaction  of 
ground  state  atomic  oxygen  with  C2H5I  and  larger  iodoalkanes,4,5  and  recently  has  been  studied  by 
photoionization6  and  FT-IR  emission  techniques.7  AfH298(HOI)  has  not  previously  been  determined 
experimentally,  despite  its  importance  to  understanding  the  photodissociation  dynamics8  and 
photoionization6  of  hypohalous  acids  and  the  roles  of  HOI  in  the  mechanisms  of  iodine-mediated 
flame  suppression9  and  stratospheric  iodine  chemistry.10  Here  we  employ  an  ab  initio  potential  energy 
surface  (PES)  to  estimate  the  activation  energy  for  the  reverse  rate  constant  k  „  and  thus  obtain  the 
thermochemistry  of  reaction  1.  The  result  is  checked  against  results  from  Gaussian-2  (G2)  theory, 
which  for  first  and  second-row  elements  has  a  typical  accuracy  of  about  8  kJ  mol'1.11  As  will  be 
shown,  our  measured  and  computed  AfH298(HOI)  are  in  good  accord  but  lie  around  15  -20  kJ  mol'1 
below  some  recently  proposed  values,  and  thus  HOI  is  more  stable  than  previously  thought. 
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2.  Experimental  method 

The  gas-handling  system  and  the  flash  photolysis  -  resonance  fluorescence  reactor  are 
described  elsewhere.12,13'14  Briefly,  OH  radicals  were  generated  by  flash  lamp  photolysis  of  water 
vapor  through  magnesium  fluoride  optics  (X  >  120  nm),  in  the  presence  of  excess  CF3I  in  a  buffer  of 
Ar  gas.  [OH]  was  monitored  as  a  function  of  time  by  resonance  fluorescence  at  X  »  307  nm,  excited 
from  a  flow  of  2%  H20  vapor  in  Ar  through  a  microwave-powered  discharge  lamp,  and  isolated  with 
a  band  pass  filter  before  detection  by  a  photomultiplier  tube  with  photon-counting  electronics.  The 
signals  If  from  typically  100-500  pulses  were  averaged  before  analysis  by  non-linear  least-squares 
fitting  to  an  exponential  decay  plus  a  steady  background  B  from  scattered  resonance  radiation:15 

If  =  A  exp(-kpsl  t)  +  B  (2) 

where  the  pseudo-first-order  decay  coefficient  is 

^>si  =  kiefffCFjI]  +  kdiff.  (3) 

kdiff  describes  the  first-order  loss  of  OH  by  diffusion  to  the  walls  of  the  reactor  and  was  typically  50- 
200  s'1.  The  effective  second-order  rate  constant  kl  eff  was  obtained  from  the  slope  of  plots  of  typically 
four  to  six  kpSl  values  vs  [CF3I]  from  0  to  [CFjI]^.  The  glass  reactor  was  modified  from  that 
employed  to  investigate  OH  +  silane  reactions14  by  the  addition  of  heating  tape  to  reach  elevated 
temperatures,  up  to  about  450  K,  and  by  the  use  of  an  ice  bath  to  lower  the  temperature  to  about  280 
K.  The  gas  temperature  at  the  reactor  center  was  measured  at  the  beginning  and  end  of  each 
experimental  run  with  a  moveable  chromel-alumel  thermocouple,  calibrated  against  melting  ice  and 
boiling  water.  At  each  temperature  studied  various  experimental  parameters  were  varied  to  see  how 
they  influenced  kl  eff.  These  included  the  total  pressure  P  and  average  gas  velocity,  which  determine 
the  average  gas  residence  time  in  the  reactor  before  photolysis  x^,  and  the  concentration  of  H20  and 
the  flash  lamp  energy  F,  which  together  influence  the  initial  radical  concentrations  created 
photolytically. 

3.  Experimental  results 

Table  1  summarizes  43  measurements  of  kl  eff  at  five  temperatures.  The  value  of  kleff  was 
found  to  be  independent  of  xres,  which  indicates  that  thermal  decomposition  of  CF3I  was  negligible. 
kl  eff  was  also  independent  of  [H20]  and  thus  [OH]^,  which  implies  that  OH  did  not  react 
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significantly  with  itself  or  with  the  reaction  products,  kj  eff  did  vary  significantly  and  systematically 
with  the  flash  energy  F  which  probably  reflects  secondary  chemistry  of  OH  with  photolysis  fragments 
of  CF3I.  The  concentration  of  these  fragments  will  be  approximately  proportional  to  F,  so  the  primary 
reaction  was  isolated  from  interferences  by  linear  extrapolation  to  zero  flash  energy,  as  illustrated  in 
Fig.  1.  The  five  resulting  k,(T)  values  are  plotted  in  Arrhenius  form  in  Fig.  2  and  may  be  summarized 
over  T  =  280-450  K  as 

kt  =  (9.9  ±  2.2)  x  10'12  exp(-12.7  ±  0.6  kJ  mof'/RT)  cm3  molecule'1  s'1  (4) 

where  the  errors  are  ±lo  statistical  precisions  derived  from  a  weighted  fit  that  takes  into  account 
errors  in  both  kj  and  T.  Consideration  of  the  coupling  between  the  errors  in  the  Arrhenius  parameters 
through  the  covariance  leads  to  ±lc  precision  estimates  for  k,  from  7%  at  the  central  temperature 
of  about  350  K  to  10%  at  the  extremes.  We  allow  for  possible  systematic  errors  of  ±1  x  10'14  cm3 
molecule'1  s'1  in  the  measurements  or  extrapolations  to  arrive  at  an  overall  confidence  interval  for  k3 
of  about  ±27%  at  280  K  improving  to  ±10%  at  450  K. 

4.  Computational  approach 

The  general  principles  of  quantitative  ab  initio  molecular  orbital  theory  have  been  outlined 
elsewhere.16,17  The  calculations  were  carried  out  using  the  GAUSSIAN  92  and  94  codes18,19  on  SGI 
Power  Challenge,  Cray  C-916  and  Cray  YMP-8  computers.  Standard  atomic  basis  sets  were 
employed,  except  for  iodine  where  the  basis  sets  described  by  Glukhovtsev  et  al.20  were  used.  The 
geometries  of  the  reagent  and  product  molecules  were  first  optimized  at  the  HF/6-31G(d)  level  of 
theory.  Scans  of  the  PES  at  this  level  indicated  no  barrier  to  reaction  1  beyond  the  endothermicity. 
Next,  the  molecular  structures  were  refined  at  the  MP2=full/6-31G(d)  level,  which  includes  a  partial 
correction  for  electron  correlation  effects  applied  to  all  electrons.  The  optimized  geometries  are 
illustrated  in  Fig.  3.  At  this  level  a  barrier  to  reaction  1  was  located,  and  its  geometry  is  also  shown 
in  Fig.  3.  Vibrational  frequencies  were  calculated  at  each  of  these  stationary  points  on  the  PES,  and 
they  verify  that  a  true  TS  geometry  was  obtained,  with  a  single  complex  frequency  that  corresponds 
to  motion  along  the  reaction  coordinate.  The  ab  initio  HF  and  MP2  frequencies  were  scaled  by 
0.8929  and  0.9646,21  respectively,  to  approximate  the  fundamental  frequencies  (listed  in  Table  2).  As 
can  be  seen  from  Table  2,  there  is  good  agreement  with  the  observed  frequencies  of  the  reactants  and 
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products.5,27 

Single-point  high  level  calculations  were  then  carried  out  at  the  stationary  points  to 
characterize  their  energy,  and  these  data  are  listed  in  Table  3.  These  are  frozen-core,  all-electron 
calculations  without  effective  core  potentials.20  The  QCISD(T)/6-311G(d,p)  and  MP2/ 
6-311+G(3df,2p)  results,  based  on  the  extension  of  Gaussian-2  (G2)  theory11  to  Br  and  I  by 
Glukhovtsev  et  al.20,  were  combined  according  to  the  G2(MP2)  methodology,22  where 

E[G2(MP2)]  =  E[QCISD(T)/6-311G(d,p)]  +E[MP2/6-311+G(3df,2p)]  - 

E[MP2/6-3 1  lG(d,p)]  +  HLC  +  ZPE  (5) 

An  empirical  higher-level  correction  is  based  on  the  number  of  alpha  and  beta  valence  electrons:21 
HLC  =  -0.00019 -0.00481  rip  hartrees  (1  hartree  *  2625  kJ  mol1),  and  E[G2(MP2)]  approximates 
a  complete  QCISD(T)/6-31 1+G(3df,2p)  calculation.  The  zero-point  vibrational  energies  (ZPE)  are 
based  on  MP2  results.  Because  MP2  frequencies  for  the  TS  are  available  but  HF  frequencies  are  not, 
our  methodology  for  the  PES  is  a  combination  of  the  G2(MP2)22  and  G2(ZPE=MP2)21  methods.  It 
is  noted  that  calculation  of  relative  energies  on  the  PES  cancels  the  HLC  terms.  Finally,  several  small 
iodine-containing  species  (HOI,  INO  and  ICN)  and  C1CN  were  characterized  with  standard  G2 
theory,11  which  employs  ZPE  calculated  at  the  HF  level  and  requires  additional  MP4/6-31  lG(d,p), 
MP4/6-311+G(d,p)  and  MP4/6-311G(2df,p)  calculations  (Table  4). 

5.  Discussion 
5.1  Kinetics 

As  shown  on  Fig.  2,  the  present  measurements  for  k,  he  midway  between  the  two  earlier 
room-temperature  determinations.  Neither  is  obviously  flawed,  although  the  authors  of  the  discharge 
-  fast  flow  study3  (smaller  kj  value)  suggested  that  the  laser  photolysis  -  resonance  fluorescence  study2 
(larger  k,  value)  may  have  been  affected  by  unrecognized  secondary  chemistry  that  consumed  OH, 
even  though  the  latter  authors  sought  such  effects  by  variation  of  the  laser  intensity  but  did  not 
observe  systematic  changes  in  kt.  The  present  data  yield  the  first  Arrhenius  parameters  for  reaction 
1,  and  the  A  factor  found  here,  around  10'11  cm3  molecule1  s'1,  implies  a  fairly  loose  TS  and  is  typical 
of  A  factors  found  for  H  abstraction  reactions  by  OH  from,  for  example,  H2  or  CH4.23 

The  rate  constant  at  the  mean  tropospheric  temperature  of  277  K,  estimated  from  eq.  4  to  be 
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4.0  x  10'14  cm3  molecule1  s'1,  is  about  5  times  that  for  OH  +  CH3CC13,  and  thus  the  tropospheric 
lifetime  of  CF3I  with  respect  to  OH  attack  is  about  0.2  of  that  of  CH3CC13,  i.e.  about  1.5  years.24  This 
result  may  be  compared  to  a  photolytic  lifetime  of  less  than  two  days25  which  indicates  that  the  major 
loss  pathway  for  CF3I  in  the  troposphere  is  via  photolysis. 

Equation  4  could  be  extrapolated  to  combustion  temperatures,  but  frequently  it  is  found  that 
rate  expressions  of  the  form  A  T°  exp(-B/T)  are  more  realistic  than  simple  Arrhenius  expressions  over 
wide  temperature  ranges.  We  have  estimated  n  =  1.5  for  reaction  1  from  the  vibrational  frequencies 
of  the  TS  and  reactants,  as  outlined  by  Cohen,26  and  a  fit  of  A  and  B  to  our  data  yields  the  following 
rate  expression  which  is  proposed  for  280-2000  K: 

kj  =  2.9  x  1016  (T/K)1’5  exp(-960  K/T)  cm3  molecule'1  s'1  (6) 

This  expression  implies  kt  ~  10'11  cm3  molecule'1  s'1  at  1500  K  so  that  attack  by  OH  in  a  flame  is 
rapid,  and  reaction  1  might  therefore  be  a  significant  destruction  pathway  for  CF3I  under  combustion 
conditions. 

5.2  Product  channels 

The  products  shown  for  reaction  1,  CF3  +  HOI,  are  those  that  would  be  formed  by  I-atom 
abstraction  by  OH.  The  use  of  reaction  1  by  Monks  et  aL6  as  a  source  of  HOI  demonstrates  that  HOI 
production  is  a  significant  pathway,  but  other  thermochemically  plausible  product  channels  might  also 
be  important.  An  alternative  pathway  is  substitution 

CF3I  +  OH  -  CF3OH  + 1  (lb) 

which  is  exothermic  by  252  kJ  mol1  at  298  K.2728  We  were  not  able  to  find  a  TS  for  this  reaction  with 
attack  by  OH  at  the  opposite  side  of  CF3I  from  the  C-I  bond,  but  did  locate  a  TS  for  attack  at  the 
same  face  of  CF3I.  Thus  if  substitution  occurs  it  appears  to  preserve  the  CF3  group  without  inversion. 
The  HF/3-21G(d)  TS  structure  has  greatly  extended  C-I  and  C-0  bonds,  with  lengths  of  3.20  and 
2.84  x  10"10  m,  respectively.  The  MP4/6-31G(d)//HF/3-21G(d)  energy  is  202  kJ  mo!1  above  CF3I  + 
OH,  so  that  channel  lb  has  too  high  a  barrier  to  contribute  to  the  observed  kinetics.  As  a  check  on 
the  reliability  of  this  barrier,  the  MP4/6-31G(d)//HF/3-21G(d)  energy  of  the  TS  for  reaction  1  was 
computed,  and  found  to  be  56  kJ  mol'1  above  CF3I  +  OH.  As  discussed  in  the  next  section,  this  is 
about  70  kJ  mol1  too  positive  by  comparison  with  more  computationally  intensive  G2(MP2)  results. 
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If  a  similar  correction  were  applied  to  reaction  lb  its  barrier  would  still  be  very  high,  around  130  kJ 
mol1. 

Ruscic  and  Berkowitz29  noted  that  a  possible  product  channel  is 

CF3I  +  OH  -  CF30  +  HI  (1c) 

which  is  exothermic  by  59  kJ  mol'1  at  298  K.27'28  We  were  not  able  to  find  a  TS  for  this  reaction, 
although  this  does  not  prove  that  channel  lc  does  not  occur.  However,  the  maximum  MP4/6-31G(d) 
energy  along  a  linear  synchronous  path  connecting  reactants  and  products  is  574  kJ  mol"1  above  the 
reactants.  This  is  an  upper  limit  to  the  barrier  to  this  reaction  if  a  TS  exists,  but  suggests  that  channel 
lc  will  be  extremely  slow. 

Wine  and  McKee  and  coworkers  have  investigated  adduct  formation  between  halogenated 
methanes  and  halogen  atoms  and  hydroxyl  radicals.  They  observed  chlorine  atom  adduct  formation 
at  low  temperatures  (around  250  K)  and  characterized  adducts  computationally  using  density 
functional  theory.30  We  have  analyzed 

CF3I  +  OH  (+  M)  -  CF3IOH  (+  M)  (Id) 

at  the  B3LYP/6-31G(d)  level  of  theory  and  located  an  adduct  with  an  1-0  distance  of  2.57  x  1010  m, 
bound  by  15  kJ  mol 1  at  0  K  (including  ZPE).  MP2=fiill/6-31G(d)  calculations  did  not  reveal  an 
adduct,  and  the  G2(MP2)  energy  at  the  B3LYP/6-31G(d)  geometry  (Table  3)  lies  1.5  kJ  mol1  above 
CF3I  +  OH.  Therefore  any  adduct  is  not  stable  enough  to  be  a  significant  sink  for  OH  under  our 
experimental  conditions. 

In  summary,  channels  lb-ld  appear  to  be  negligible  under  our  conditions  and  the  dominant 
products  of  reaction  1  are  CF3  +  HOI,  as  assumed  in  the  previous  experimental  kinetic  studies.2  3 

5.3  Experimental  thermochemistry  and  ab  initio  PES 

The  observation  that  the  reaction  OH  + 12  -  I  +  HOI  proceeds  at  the  collision  rate  at  298  K 
implies  this  process  cannot  be  endothermic,31  and  therefore  the  bond  dissociation  enthalpy  D298(HO-I) 
must  be  at  least  D298(I-I)  =  151  kJ  mol1,  which  combined  with  the  known  heats  of  formation27  of  I 
and  OH  implies  AfH298(HOI)  <  -5  kJ  mol1.  Jenkin  et  al.  analyzed  earlier  thermodynamic  data32  33  to 
obtain  -88  and  -82  kJ  mol"1  for  this  quantity.31  Based  on  the  idea  that  both  reaction  1  and  the  process 
HONO  +  HOI  -  H20  +  IN02  are  rapid,  Jenkin  et  al.31  deduced  that  -77  kJ  mol"1  £  AfH298(HOI)  >  - 
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102  kJ  mol'1.  The  upper  bound  rests  on  the  CF3-I  bond  strength  which  Asher  and  Ruscic  recently 
determined  to  high  precision,  D29g(CF3-I)  =  227.2  ±  1.3  kJ  mol'1.28  This  is  more  positive  than  the 
value  of 223  kJ  mol'1  available  to  Jenkin  et  aL,31  and  therefore  raises  the  upper  bound  to  AfH298(HOI). 

We  can  use  our  measured  activation  energy  for  reaction  1  to  improve  the  estimates  of 
AfH^OHOI),  based  on  the  concept  that  the  enthalpy  change  for  a  reaction  is  equal  to  the  difference 
between  the  forward  and  reverse  activation  energies: 

AH  =  E^-Em  <7) 

Using  the  CF3-I  bond  strength  at  350  K  (the  bond  strength  varies  by  less  than  0.1  kJ  mol'1  between 
298  and  350  K),27  a  temperature  that  corresponds  to  the  central  1/T  value  of  the  experiments,  it  is 
tempting  to  assume  that  the  activation  energy  for  the  reverse  reaction,  -1,  is  given  by  >  0  and 
thus  D350(HO-I)  must  be  at  least  214.5  kJ  mol'1.  However  the  reverse  reaction  could  conceivably  have 
a  negative  activation  energy,  perhaps  as  negative  as  -5  kJ  mol'1,  and  so  we  more  conservatively  derive 
D350(HO-I)  sr  209.5  kJ  mol'1  from  experimental  data  alone.  This  result  contradicts  an  empirical 
estimate  by  Zhang  et  aL34  that  D0(HO-I)  =  189  ±  2.5  kJ  mol1  with  an  associated  AjH^HOI)  =  -42.7 
±  2.5  kJ  mol'1  (which  implies  AfH298(HOI)  =  -46  kJ  mol1).  Our  result  also  contradicts  AfHy(HOI)  ~ 
-36  kJ  mol1  estimated  empirically  by  Ruscic  and  Berko witz.29 

Next  we  consider  the  PES  for  reaction  -1  in  more  detail,  using  the  G2(MP2)  ab  initio  results 
(Table  3).  These  yield  the  enthalpy  of  the  TS  relative  to  CF3  +  HOI  at  0  K,  Eq  .j*  =  -13  kJ  mol1.  This 
negative  value  is  consistent  with,  but  does  not  prove,  that  there  is  no  barrier  to  the  reverse  of  reaction 
1.  This  idea  is  tested  in  more  detail  on  Fig.  4,  where  energies  obtained  at  an  approximate  QCISD/6- 
31 1+G(3df,2p)  level35  are  plotted  as  a  function  of  the  C-I  distance  along  the  reaction  coordinate.36 
It  may  be  seen  there  is  no  barrier  to  reaction  -1,  a  result  also  derived  from  the  low-level  HF/6-31G(d) 
calculations  discussed  earlier  but  now  confirmed  at  a  much  higher  level  of  theory. 

This  conclusion  implies  an  approximately  zero  activation  energy  for  -1,  and  we  take  E^  = 
0  ±  5  kJ  mol1.  Since  the  enthalpies  of  formation  of  OH  and  I  are  well  known,27  eq.  8  is  employed  to 
estimate  AfH350(HOI)  =  -69  ±  5  kJ  mol'1,  where  the  main  source  of  uncertainty  is  Ea.,  and  the 
uncertainty  in  Evl  is  negligible  by  comparison.  The  ab  initio  moments  of  inertia  (Fig.  3)  and 
frequencies  (Table  2)  are  used  to  calculate  the  enthalpy  function,  HT-H„,  which  in  combination  with 
values  for  the  elements  in  their  reference  states27  yields  AfH(HOI)  at  298  and  0  K  of  -68  ±  5  and  -63 
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±  5  kJ  mol'1,  respectively.  We  have  calculated  S,  Cp  and  Hj.-H298  from  50  to  5000  K  and  fit  the  results 
with  standard  NASA  polynomials37  to  obtain  thermochemical  data  in  a  format  suitable  for  combustion 
and  atmospheric  models  (see  Table  5).  The  corresponding  O-I  bond  dissociation  enthalpy  is  D298(HO- 
I)  =  214  ±  5  kJ  mol1.  The  measured  AfH298(IO)  =  1 15.9  ±  5.0  kJ  mol1  from  Bedjanian  et  al.38  yields 
D298(H-OI)  =  402  ±  7  kJ  mol1,  comparable  to  the  C-H  bond  strength  in  ethane.39  Thus  H-abstraction 
by  10  from  hydrocarbons  is  thermodynamically  plausible  and  may  need  to  be  taken  into  account  in 
models  of  iodine-mediated  flame  suppression. 

5.4  Ab  initio  thermochemistry 

The  proposed  thermochemistry  of  HOI  is  in  reasonable  accord  with  the  earlier  experimental 
limit  values  outlined  above,  but  there  are  no  measurements  with  which  to  compare  the  new  result. 
The  recent  implementation  of  G2  theory  for  iodine  compounds  by  Glukhovtsev  et  al.20  enables  an 
independent  check  of  AjH^HOI).  The  G2  energy  of  HOI  (Table  4),  together  with  values  for  the 
atoms,  ,2  yields  AH0  for  atomization.  However,  this  process  involves  considerable  changes  in  the 
electronic  structures  of  the  atoms  which  might  not  be  well  accounted  for,  so  we  have  also  considered 
7  other  processes,  listed  in  Table  6,  which  relate  the  unknown  AfH0(HOI)  to  known  values  for  other 
small  molecules.  The  computed  AH,,  values  derived  from  the  G2  energies14'40  (see  Table  4)  for  these 
reactions  together  with  experimental  values  for  AfHo  of  H,  O,  I,  H20,  HI,  H2S,  C1CN,  ICN,  HOC1, 
HC1,  HNO,  INO,  CH3r*  and  HSf4  lead  to  the  AtH0(HOI)  values  listed  in  Table  6.  The  mean  of 
the  8  values  is  -60  kJ  mol 1  (standard  deviation  6  kJ  mol1)  which  is  in  good  accord  with  our  new 
experimental  value  of  -63  ±  5  kJ  mol'1. 

This  computed  AfH0(HOI)  is  15  kJ  mol 1  more  negative  than  the  value  recently  obtained  by 
Glukhovtsev  et  al. 5  also  using  G2  theory.  The  difference  appears  to  arise  about  equally  from  their 
use  of  effective  core  potentials,  by  contrast  to  the  all-electron  basis  sets  employed  here,  and  from  their 
consideration  of  only  the  first  working  reaction  of  Table  5.  This  atomization  step  gives  the  most 
positive  AfHofHOI)  of  all  the  processes  analyzed,  but  we  consider  it  the  least  reliable.  Very  recently 
Hassanzadeh  and  Irikura40  used  different  working  reactions  and  CCSD(T)  theory  to  obtain  AfH(HOI) 
at  0  and  298  K  of -57.0  and  -61.7  ±  7.0  kJ  mol'1,  respectively,  in  excellent  accord  with  the  computed 
and  experimentally-based  values  presented  here. 
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6.  Conclusions 

The  temperature  dependence  of  the  rate  constant  for  the  reaction  OH  +  CF3I  has  been 
measured,  and  ab  initio  analysis  suggests  the  dominant  products  are  CF3  and  HOI.  The  measured 
Arrhenius  parameters,  together  with  an  estimated  activation  energy  E^.,  for  the  reverse  reaction 
supported  by  an  ab  initio  analysis  of  the  potential  energy  surface,  yield  the  heat  of  formation  of  HOI. 
The  main  uncertainty  in  the  present  work  lies  in  E^.,.  This  experimental  value  for  AfH(HOI)  is 
confirmed  by  a  direct  computational  estimate,  and  is  more  negative  than  recent  empirical  values.  The 
temperature  dependence  of  the  rate  constant  suggests  that  OH  attack  is  a  minor  contributor  to  the 
loss  of  CF3I  in  the  troposphere,  but  a  potentially  significant  channel  for  CF3I  destruction  under 
combustion  conditions. 
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Table  1.  Summary  of  experimental  measurements  for  the  reaction  CF3I  +  OH. 


T,  K 

P,  mbar 

ties’  S 

F,  J 

[H20], 

1015  cm-3 

[CF^U 

1014cm'3 

k;l,eff±CJkl,eff  ’ 

10"13  molecule1 

cm3  s"1 

281 

68.2 

1.6 

5.00 

3.8 

12.1 

1.96  ±0.14 

281 

54.1 

1.0 

5.00 

2.3 

8.8 

1.74  ±0.04 

281 

89.8 

1.7 

5.00 

3.1 

10.9 

1.67  ±0.06 

281 

68.2 

1.6 

4.05 

3.8 

12.1 

1.71  ±0.10 

281 

54.1 

1.0 

4.05 

2.3 

8.8 

1.52  ±0.05 

281 

89.8 

1.7 

4.05 

3.1 

10.9 

1.29  ±0.08 

281 

68.2 

1.6 

2.50 

3.8 

10.0 

1.25  ±0.09 

281 

54.1 

1.0 

2.50 

2.3 

8.8 

1.01  ±0.06 

281 

89.8 

1.7 

2.50 

3.1 

10.9 

1.07  ±0.06 

281 

0.43  ±  0.0 la 

292 

75.2 

1.7 

5.00 

2.8 

17.8 

1.39  ±0.10 

292 

75.3 

1.7 

5.00 

2.2 

13.9 

2.18  ±0.15 

292 

71.0 

1.6 

5.00 

4.0 

12.7 

2. 10  ±0.09 

292 

63.9 

1.4 

5.00 

3.3 

15.6 

1.59  ±0.13 

292 

68.0 

1.5 

4.05 

2.6 

22.1 

1.48  ±0.06 

292 

59.1 

1.0 

4.05 

1.7 

16.9 

1.44  ±0.05 

292 

71.1 

1.6 

4.05 

2.8 

14.3 

1.88  ±0.13 

292 

72.1 

1.6 

4.05 

4.7 

13.5 

1.90  ±0.12 

292 

75.2 

1.7 

2.50 

2.8 

17.9 

1.26  ±0.07 
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443 

89.0 

1.8 

4.05 

3.4 

8.1 

4.34  ±0.17 

442 

59.5 

1.2 

4.05 

2.4 

7.0 

3.94  ±0.18 

445 

92.0 

1.3 

2.50 

3.0 

2.0 

4.48  ±0.17 

443 

89.0 

1.8 

2.50 

3.4 

6.0 

3.92  ±0.23 

442 

59.5 

1.2 

2.50 

2.4 

7.0 

3.64  ±0.19 

443 _ 3.38  ±0.41* 

‘linear  extrapolation  of  keff  to  zero  flash  energy  for  each  temperature  and  lc  uncertainty  of  the 

intercept. 
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Table  2.  Fundamental  vibrational  frequencies  v0  of  stationary  points  on  the  CF3I  +  OH  PES  (cm-1),  calculated  at  the  HF  and  MP2  levels 
of  theory  with  the  6-31G(d)  basis  set.* 
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“HF  values  scaled  by  0.8929  and  MP2  values  scaled  by  0.9646.  Doubly  degenerate  modes  are  indicated  by  (2). 


Table  3.  G2(MP2)  energies  for  stationary  points  on  the  OH  +  CF3I PES,  in  hartrees. 


Species 

QCISD(T)/ 

6-311G(d,p) 

MP2/6-311 

+G(3df,2p) 

MP2/ 

6-311G(d,p) 

HLC 

ZPE 

G2(MP2) 

OH 

-75.58921 

-75.61742 

-75.57276 

-0.01519 

0.00822 

-75.64084 

CF3I 

-7253.96099 

-7254.18589 

-7253.92435 

-0.08000 

0.01378 

-7254.28875 

cf3i-oh*  ts 

-7329.53285 

-7329.79873 

-7329.47649 

-0.09519 

0.02554 

-7329.92474 

HOI 

-6992.56705 

-6992.65970 

-6992.54346 

-0.03500 

0.01197 

-6992.70632 

cf3 

-336.96752 

-337.14009 

-336.94227 

-0.06019 

0.01203 

-337.21350 

CF3IOH  adduct 

-7329.54763 

-7329.80225 

-7329.49104 

-0.09519 

0.02503b 

-7329.92900 

•Based  on  scaled  MP2  frequencies  and  HLC  =  -0.00019  n*  -  0.00481  n^.  1  hartree  ~  2625  kJ  mol'1. 
bBased  on  unsealed  B3LYP/6-3  lG(d)  frequencies. 


Table  4.  G2  energies  of  ancillary  molecules  used  to  estimate  AfH0(HOT)  * 


Species  MP4/6-311G(d.p)  ZPE  AE(+)  AEQdf)  AE(OCD  A  E»(G2) 
HOI  -6992.56772  0.01215  -0.01045  -0.09163  0.00067  -0.02483  -6992.71681 

INO  -7046.58764  0.00591  -0.00725  -0.11020  0.00943  -0.02678  -7046.76153 

ICN  -7009.55409  0.00784  -0.00443  -0.09583  0.00477  -0.02382  -7009.70557 

C1CN _ -552.27617  0.00873  -0.00671  -0.10834  0.00466  -0.00687  -552.42469 


Tn  hartrees. 
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Table  5.  Thermochemistry  of  HOI  and  NASA  polynomials. 


AfHoCHOI)  =  -63  ±  5  kJ  mol'1  AfH298(HOI)  =  -68  ±  5  kJ  mol'1 

Cp.298  =  38.5  J  K'1  mol'1  S298  =  255.4  J  K 1  mol'1  H298-Ho  =  10.4  kJ  mol'1 

C/R  =  aL  +  a,T  +  a3T2  +  a4T3  +  a5T* 

(AfH298  +  Ht-H298)/RT  =  aL  +  a,T/2  +  a3T2/3  +  a4T3/4  +  %V/5  +  a /T 
S/R  =  aL  In  T  +  aj  +  a3T2/2  +  a4T3/3  +  a5T74  +  a7 

0.263 54949E+01  0 . 5293 4503E- 02-0 . 25414167E-05  0 . 53307325E-09-0 .40591408E-13 
-0 .89380615E+04  0 . 14730852E+02  0 .39206498E+01  0 . 67459485E-04  0 . 12938654E-04 
-0 . 20159172E-07  0 . 91124192E-11-0 . 94277812E+04  0 .79484367E+01* 


‘First  seven  numbers  are  at  -  a7  for  1000-5000  K,  followed  by  a!  -  a7  for  50-1000  K. 
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Table  6.  G2  thermochemistry  of  working  reactions  used  to  calculate  AfH^HOI),  kJ  mol'1. 


Reaction 

AH,, 

AA.CHOI  )a 

HOI  -  H  +  0  + 1 

623.6 

-53.6 

HOI  +  H2  -  H20  +  HI 

-150.4 

-60.0 

HOI  +  HC1  -  HOC1  +  HI 

105.0 

-55.8 

HOI  +  HI  -  H20  + 12 

-140.4 

-61.6 

HOI  +  C1CN  -  ICN  +  HOC1 

71.8 

-54.0 

HOI  +  HNO  -  INO  +  H20 

-161.1 

-62.7 

HOI  +  H2S  -  HSI  +  H20 

-103.0 

-73.0 

HOI  +  CH4  -  CH3I  +  H20 

-90.2 

-56.8 

Figure  captions 


Fig.  1  Extrapolations  of  the  effective  rate  constant  for  OH  +  CF3I,  kl  e£f,  to  zero  radical  concentration 
(flash  lamp  energy  F  =  0)  at  each  temperature.  The  points  represent  weighted  means  of 
measurements  at  the  same  F. 

Fig.  2  Arrhenius  plot  for  OH  +  CF3I.  (•)  present  work;  (O)  ref.  2;  (□)  ref.  3.  Error  bars  represent  ±lo 
precision. 

Fig.  3  Geometries  of  reactants,  transition  state  and  products  for  the  OH  +  CF3I PES.  Angles  in  degrees 
and  bond  lengths  in  10'10  m.  MP2=full/6-31G(d)  theory  employed,  with  HF/6-31G(d)  values  in 
parentheses.  CF3  and  CF3I  have  Qv  symmetry.  Dihedral  angles  in  the  TS:  CIOH,  81.9°;  FCIO  (long 
C-F  bond),  175.6°;  FCIO  (short  C-F  bonds),  56.7°  and  -65.9°. 

Fig.  4  Relative  energies  along  the  reaction  coordinate  for  OH  +  CF3I  -  HOI  +  CF3  computed  at  (O) 
MP2=full/6-31G(d)  and  (■)  approximate  (see  footnote  36)  QCISD/6-311+G(3df,2p)  levels  of 
theory.  The  energy  zero  is  set  at  HOI  +  CF3,  where  r(C-I)  =  °°.  r(C-I)  =  2.17  x  10'10  m  in  the 
reactants  and  2.53  x  1010  m  at  the  MP2=full/6-31G(d)  TS. 


16-23 


16-24 


T/K 


450  400  350  300 


16-25 


F 


1.335 

(1.308) 


1 10.3C 


.  (110.4°) 

& 


2.169 

(2.157) 


0.979 

(0.959) 


H 

103.5°  / 

(107.0°)  /  0979 

(0.959) 

I - -O 

2.038 

(1.987) 


16-26 


rgy  /  kJ  mol 


16-27 


Hui  Meng 

Report  not  available  at  time  of  publication. 


17-1 


BAND-GAP  CALCULATIONS  ON  SQUARATE- CONTAINING  CONJUGATED  OLIGOMERS 
FOR  THE  PREDICTION  OF  CONDUCTIVE  AND  NON-LINEAR 
OPTICAL  PROPERTIES  OF  POLYMERIC  MATERIALS 


Douglas  J.  Miller,  PhD 
Professor  of  Chemistry 
Science  &  Mathematics  Department 


Cedarville  College 
PO  Box  601 

Cedarville,  OH  45314-0601 


Final  Report  for: 

Summer  Research  Extension  Program 
Wright  Laboratory /Materials  Directorate 


Sponsored  by: 

Air  Force  Office  of  Scientific  Research 
Bolling  AFB,  Washington  DC 

and 


Wright  Laboratory/Materials  Directorate 


March  1998 


18-1 


BAND-GAP  CALCULATIONS  ON  SQUARATE - CONTAINING  CONJUGATED  OLIGOMERS 
FOR  THE  PREDICTION  OF  CONDUCTIVE  AND  NON-LINEAR 
OPTICAL  PROPERTIES  OF  POLYMERIC  MATERIALS 

Douglas  J.  Miller,  PhD 
Professor  of  Chemistry 
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ABSTRACT 

Attempts  were  made  to  calculate  the  band  gaps  of  conjugated  oligomers 
containing  squarate  moieties  and  f ive-membered  rings  which  incorporated 
nitrogen  atoms  (pyrrole- type  structure) .  Computational  software  (Gaussian, 
GAMESS,  and  HyperChem)  was  utilized  in  this  work.  Extrapolation  of  the 
results  for  the  oligomers  to  the  polymeric  material  was  made  in  order  to 
ascertain  whether  it  might  possess  desired  conductive  and  nonlinear  optical 
properties.  (Figures  and  tables  are  placed  after  the  footnotes  page.) 
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BAND-GAP  CALCULATIONS  ON  SQUARATE -CONTAINING  CONJUGATED  OLIGOMERS 
FOR  THE  PREDICTION  OF  CONDUCTIVE  AND  NON-LINEAR 
OPTICAL  PROPERTIES  OF  POLYMERIC  MATERIALS 

Douglas  J.  Miller,  PhD 


INTRODUCTION 

Conjugated  polymers  are  important  for  their  applications  in  electronic 
technologies.1  The  band-gap  is  a  key  material  property  for  (semi) conductive 
and  nonlinear  optical  properties,  and  much  of  the  current  focus  is  on 
designing  new  polymers  having  small  band  gaps.2  Typical  undoped  conjugated 
polymers  have  band  gaps  in  the  1.5  -  3.5  eV  range,  so  polymers  possessing 
values  less  than  or  equal  to  1.0  eV  represent  important  candidate  materials. 

We  recently  proposed3  a  scheme  whereby  low  band  gap  materials  could  be 
designed.  The  underlying  principle  is  that  of  symmetric  ground  states. 
Briefly,  these  are  polymers  in  which  two  equivalent,  isoenergetic , 
isostructural  resonance  forms  contribute  equally  to  the  real  structure. 
Efforts  begun  during  the  1996  Summer  Research  Program  extended  this  work  to 
systems  having  an  all -carbon  backbone,  a  negative  charge,  and  a  very  rigid 
structure.  The  current  research  extended  this  work  to  a  system  which 
contained  both  the  squarate  group  and  five  membered  rings  which  incorporated 
nitrogen  atoms.  The  squarate  group  has  been  in  molecules  of  recent  interest 
for  their  nonlinear  optical  properties.4  All  of  the  studied  oligomers  had 
rigid  structures  and  an  overall  neutral  charge. 
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methods  and  discussion 

Electronic  properties  for  the  polder  of  . 
iron,  values  obtained  from  ”ere  extrapolated 

~->  for  oij:  —  and 

copolymer  in  which  A  was  the  i  3  ^  C°ndUCtln9  P°lymer  was  an  ABAB  f 

-  shortest  Oliver  ^  ^  “  B  ~  ““  —ole 
a  mer,  shown  m  Figure  1  ,» 

contain  a  bipyrrole;  but  all  h  esl3nated  P2S,  did  not 

designated  as  Pns  or  PnB,  „here  „  ref„s  °ll9°”«S  “• 

rings,  and  the  s  indicates  that  a  squ  tOCal  °C  Pyrrole 

fc  a  scJuarate  moiety  is  in  m 

oligomers  while  a  B  indicates  that  a  bipyrrole  oc  • 

of  the  molecule  The  ol '  ccuPies  the  central  portion 

The  oligomer  designated  P4B  „h 

generated  by  creating  .  .dimer-  of  the  P2S  «  "  “  *'  “ 

“e  P2S  structure.  pee  ,• 

adding  a  third  P2S  unit  S  ^enerated  by 

*il  Of  the  Oligomers  have  c2h  sy™,etry  and  are  t  ■ 

Pyrrole  rings.  The  two  slightl  terminated  by  single 

th  structurally  different  series  of  ,  ■ 

thus  generated  (P2S,  P6S  pine  oligomers 

'  S'  •••  and  P4B,  P8B,  P12B  .  . 

infinite  limit,  represent  the  "  ld'  in  the 

present  the  same  polymer  tv,o  ,  . 

-^ting  double  bonds,  possess  rigid  pia„ar  stllT  a^  ‘  " 

electrically  neutral.  Bach  mer  unit  .yrrole-sguarate-p^ole, 

a  formal  -1  charoe  rm  ^  X  '  h  wever-  has 

charge  on  an  oxygen  of  the  sguar.te  and  a  forma, 

the  nitrogen  atom  of  one  of  the  **  Char9s  °n 

Of  the  pyrrole  rings. 

Quantum  chemistry  software  was  fir<1(.  .  , 

“SinS  -"-"P-ical  (M1)  parameters  ^  ^ 

Rough  z-matrices  were 

for  geometry  optimizations  in  the  r  35  lnpUt 

s  ln  hue  Gaussian  94W 

The  geometries  were  the  e  ,  (cunning  on  a  Pc)  . 

'  then  fully  optimized  within  c 

C2h  symmetry  at  the  AMI 
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theory  level  using  the  GAMESS  program.  The  final  geometries  were  then  used 
as  structural  input  for  electronic  calculations  using  GAMESS  or 
spectroscopic  calculations  using  HyperChem. 

The  band-gap  for  the  polymer  was  determined  by  extrapolating,  to  an 
infinite  length,  the  band-gap  values  obtained  for  the  oligomers.  The  band- 
gap  for  each  oligomer  was  calculated  by  finding  the  difference  between  the 
energy  of  the  ground  state  (singlet)  and  the  lowest  lying  excited  state 
(triplet) .  These  energies  were  determined  by  Hartree-Foch  methods  in 
GAMESS.  The  singlet  states  were  calculated  at  the  RHF/6-31G*  level  while 
the  triplet  states  were  calculated  at  the  ROHF/6-31G*  level.  All 
calculations  were  performed  at  the  ground  state  geometry  obtained  by  the  AMI 
optimizations  described  above.  We  noted  that  the  second  order  SCF  (SOSCF) 
energies  did  not  always  converge  correctly,  so  all  results  reported  were 
obtained  with  SOSCF  turned  off. 

Vertical  electron  affinities  for  the  oligomers  were  determined  as  the 
difference  in  energy  between  the  ground  state  (singlet)  and  the  monoanion 
(doublet)  obtained  by  adding  an  electron  to  the  neutral  oligomers.  The 
monoanion  (doublet)  energy  was  calculated  at  the  ROHF/6-31G*  level  of 
theory.  This  was  done  in  order  to  estimate  the  stability  of  the  neutral 
species  in  the  real  world. 

The  band-gap  for  the  polymer  was  also  obtained  by  finding  the  energy 
difference  between  the  the  ground  state  (singlet)  and  first  excited  state 
(also  singlet  in  this  case)  for  each  oligomer  and  extrapolating  the  results 
to  infinite  chain  length.  These  energies  were  obtained  by  the  ZINDO/S 
method  as  implemented  in  HyperChem  (Release  5.01  for  Windows95)  .  Each  Cl 
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calculation  incorporated  the  15  highest  energy,  occupied  orbitals  and  the  15 
lowest  unoccupied  orbitals  (a  15x15  Cl)  .  The  SCF  Convergence  Limit  was  set 
to  0.0001,  Accelerate  Convergence  was  set  to  "off",  and  the  Pi -Pi  overlap 
was  changed  from  the  default  to  0.585  as  recommended  for  organic  materials.8 
All  calculations  employed  the  AMI  geometries  obtained  above.  These 
HyperChem  calculations  were  carried  out  to  compare  to  the  results  obtained 
from  GAMESS  and  to  give  impressions  of  trends  if  GAMESS  results  were  not 
obtainable . 

RESULTS  AND  CONCLUSION 

The  energies,  in  Hartrees  as  determined  in  GAMESS,  of  the  ground  states 
(singlets)  and  the  excited  states  (triplets)  of  various  oligomers  are  listed 
in  Table  1.  Ground  state  energies  were  determined  for  all  oligomers  up 
through  P12B.  Determination  of  the  excited  states  became  problematic  for 
P10S  and  P12B ,  and  efforts  are  continuing. 

Band-gap  values  for  the  oligomers,  as  calculated  from  both  the  GAMESS 
results  and  the  HyperChem  ZINDO/S  computations,  are  listed  in  Table  2. 
Figure  3  illustrates  the  four  band-gap  data  points  (connected  using  a  cubic 
B-spline  curve)  obtained  from  GAMESS.  The  x-axis  units  are  1/n  where  n  is 
the  number  of  pyrrole  rings  in  the  oligomer.  The  graph  shows  that  the  value 
of  the  band-gap  decreases  with  increasing  length  (smaller  1/n  values)  of  the 
oligomer.  This  graph  suggests  that  the  band-gap  for  the  polymer  (1/n  near 
zero)  is  below  0.60  eVs .  More  data  points  are  required  to  more  clearly 
establish  the  trend,  especially  since  the  PnS  and  PnB  series  will  likely 
take  somewhat  different  paths  to  the  same  ultimate  band-gap  value  for  the 
polymer.  Connecting  the  band-gap  values  for  P4B  and  P8B  with  a  straight 
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line  (see  Figure  3)  leads  to  a  band-gap  of  0.488  eVs  at  infinite  length.  A 
similar  line  drawn  through  the  band-gap  values  for  P2S  and  P6S  leads  to  a 
band-gap  value  of  0.531  eVs  at  infinite  length.  Figure  5  illustrates  the 
relationship  of  band-gap  values,  as  determined  by  HyperChem,  for  all  six 
oligomers  (P2S,  P4B ,  P6S ,  P8B,  P10S,  and  P12B) .  Again  the  connected  points 
appear  to  be  directed  toward  the  neighborhood  of  0.60  eVs .  A  "best" 
straight  line  drawn  through  these  data  points,  minus  the  one  for  P2S,  leads 
to  a  band-gap  of  0.552  eVs  at  infinite  length  (see  Figure  6).  The  band-gap 
value  for  P2S  was  left  out  of  Figure  6  since  P2S  is  the  most  likely  oligomer 
to  exhibit  the  effects  of  limited  length  and  the  effects  due  to  being  end- 
capped  by  single  pyrroles. 

All  of  the  results  point  toward  a  band-gap  value  of  less  than  0.60  eVs 
for  the  candidate  polymer.  This  could  indicate  that  it  would  be  worthwhile 
to  attempt  to  synthesize  the  material  in  order  to  study  its  conductive  and 
non-linear  optical  properties.  The  electron  affinities  (Table  3)  point  to  a 
potential  instability,  relative  to  reduction,  of  this  polymer  in  the  real 
world.  Polymers  with  similar  structures  that  have  other  substituents 
attached  would  also  be  worthy  candidates  for  computational  investigation. 

DJM : 1 jm 


18-7 


REFERENCES 


(1)  Gorman,  C.  B.;  Grubbs,  R.  H.  In  Conjugated  Polymers;  Bredas,  J.  L.  ; 
Silbey,  R.,  Eds.;  Kluwer  Academic  Publishers:  Boston,  1991;  pp.  1-48. 

(2)  Kertesz ,  M. ;  Ashertehrani  A.  Macromolecules  1996,  29(3),  940-945. 

(3)  Unpublished  Results. 

(4)  Dirk,  C.  W.;  Herndon,  W.  C.;  Cervantes -Lee,  F. ;  Seinau,  H. ;  Martinez, 
S.;  Kalamegham,  P.;  Tan,  A.;  Campos,  G. ;  Velez,  M. ;  Zyss,  J. ;  Ledoux, 

I. ;  Cheng,  L.  J.  Am.  Chem.  Soc.  1995,  11 7(8),  2214-2225. 

(5)  Gaussian  92,  Revision  A,  M.  J.  Frisch,  G.  W.  Trucks,  M.  Head-Gordon, 

P.  M.  W.  Gill,  M.  W.  Wong,  J.  B.  Foresman,  B.  G.  Johnson,  H.  B. 

Schlegel,  M.  A.  Robb,  E.  S.  Replogle,  R.  Gomperts,  J.  L.  Andres,  K. 

Raghavachari ,  J.  S.  Binkley,  C.  Gonzalez,  R.  L.  Martin,  D.  J.  Fox,  D. 

J.  Defrees,  J.  Baker,  J.  J.  P.  Stewart,  and  J.  A.  Pople,  Gaussian, 
Inc.,  Pittsburgh  PA,  1992. 

(6)  Gaussian  94,  Revision  A,  M.  J.  Frisch,  G.  W.  Trucks,  H.  B.  Schlegel, 

P.  M.  W.  Gill,  B.  G.  Johnson,  M.  A.  Robb,  J.  R.  Cheeseman,  T.  A. 

Keith,  G.  A.  Petersson,  J.  A.  Montgomery,  K.  Raghavachari,  M.  A.  Al- 
Laham,  V.  G.  Zakrzewski,  J.  V.  Ortiz,  J.  B.  Foresman,  J.  Cioslowski, 
B.  B.  Stefanov,  A.  Nanayakkara,  M.  Challacombe,  C.  Y.  Peng,  P.  Y. 

Ayala,  w.  Chen,  M.  W.  Wong,  J.  L.  Andres,  E.  S.  Replogle,  R.  Gomperts, 
R.  L.  Martin,  D.  J.  Fox,  J.  S.  Binkley,  D.  J.  Defrees,  J.  Baker,  J.  P. 
Stewart,  M.  Head-Gordon,  C.  Gonzalez,  and  J.  A.  Pople,  Gaussian,  Inc., 
Pittsburgh  PA,  1995. 

(7)  Schmidt,  M.  W.;  Baldridge,  K.  K.  ;  Boatz,  J.  A.;  Elbert,  S.  T.  ;  Gordon, 

M.  S.;  Jensen,  J.  H. ;  Koseki,  S. ;  Matsunaga,  N. ;  Nguyen,  K.  A.;  Su,  S. 
J.;  Windus,  T.  L.;  Dupuis,  M.  ;  Montgomery,  J.  A.  J.  Comput.  Chem. 

1993,  14,  1347-1363. 

(8)  Ridley,  J.  E.;  Zemer,  M.  C.  Theoret.  Chim.  Acta  1976,  42,  223-236. 


18-8 


Table  Is  Energies  (in  Hartrees)  as  Determined  by  GAMESS 


Oligomer 

Designation  Ground  State  (Singlet) 
P2S  -717.6080634653 

P4B  -1434.0716942861 

P6S  -2150.5352758266 

P8B  -2866.9987431638 

P10S  -3583.4620287117 

P12B  -4299.9245800857 


Excited  State  (Triplet) 
-717.5782832373 
-1434.0451194595 
-2150.5123118523 
-2866.9764854880 
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Table  2:  Band-gap  Energies  (in  eVs) 


Oligomer 

Designation  From  GAMESS  From  HyperChem 


P2S 

0.8103 

2.241 

P4B 

0.7231 

1.569 

P6S 

0.6248 

1.250 

P8B 

0.6056 

1.072 

P10S 

- 

0.958 

P12B 

_ 

0.885 
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Figure  4,  GAM  ESS  Results  for  P4B  and  P8B 
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1/n,  n  =  number  of  pyrrole  units 
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1/n,  n  =  number  of  pyrrole  units 


1-6  h  Figure  6,  HyperChem  Results  for  All  Except  P2S 
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Table 


Electron  Affinities  from  GAMESS  (in  eVs) 


Oligomer 


Designation 

Electron  Affinities 

P2S 

0 . 814 

P4B 

1-294 

P6S 

1.927 

P8B 

1.914 

PI  OS 

_ 

P12B 
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FINAL  REPORT:  CLASSIFICATION  AND  VISUALIZATION  OF 
TISSUE  IN  MULTIPLE  MODALITIES  OF  BRAIN  MR 

Timothy  S.  Newman,  Ph.D. 

Department  of  Computer  Science 
University  of  Alabama  in  Huntsville 
Huntsville,  AL  35899 

Abstract 

In  this  project,  tissue  classification  and  visualization  in  multiple  modalities  of  magnetic  resonance  (MR)  data  of 
the  brain  was  investigated.  The  project  focused  on  enhancement  of  tissue  segmentation  and  classification  methods  and 
exploration  of  effective  visualization  of  anatomical  structures.  Much  of  the  effort  examined  extracting  and  rendering 
ventricular  and  vascular  structures.  A  number  of  visualization  techniques  were  developed  and  applied  to  MR  brain 
datasets.  The  techniques  are  described  and  sample  final  visualizations  are  exhibited  in  this  report. 
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1  Introduction 

In  this  report,  we  describe  our  efforts  to  effectively  visualize  the  brain’s  ventricular  and  vascular  systems  in  multiple 
modalities  of  magnetic  resonance  imaging  (MRI)  data.  These  efforts  are  a  continuation  of  the  work  that  began  at  Wright 
Labs  in  Summer  1996  in  collaboration  with  James  Leonard.  The  investigation  of  visualization  of  the  ventricular  system 
involved  four  modalities  of  MR  data,  including  magnetic  resonance  angiography  (MRA),  Tl-weighted,  T2-weighted, 
and  proton  density  datasets.  Visualization  of  the  brain’s  vascular  system  was  investigated  using  MRA  data. 

This  report  is  organized  as  follows.  First,  the  problem  that  was  investigated  is  motivated.  Second,  some  of  the 
related  projects  from  the  literature  are  described.  Next,  an  overview  of  the  work  that  was  conducted  at  Wright  Lab 
in  1996  is  presented  and  finally  the  extension  of  that  work  in  this  extension  project  is  described  and  some  results  are 
exhibited. 


2  Motivation  and  Background 

The  motivating  factors  for  this  research  included  enhancing  capabilities  to  exploit  pre-operative  (and  possibly  intra¬ 
operative)  image  data  in  surgical  planning  and  guidance.  Providing  anatomical  and  functional  information  to  clinical 
staff  can  also  aid  in  diagnostic  assessment  and  treatment  monitoring.  A  number  of  papers  in  the  literature  have 
described  the  value  of  high-resolution  high  level  information  (for  example,  visualization  of  the  3D  boundaries  of  all 
relevant  structures)  in  a  region  of  interest  for  surgery  planning.  For  example,  the  reader  may  wish  to  consider  the 
recent  papers  [1,  30,  31].  For  surgical  and  radiotherapy  interventions  deep  in  the  brain,  it  is  especially  useful  for  the 
locations  of  cortical  structures  and  blood  vessels  to  be  well-known  [19,  30], 

The  goals  of  this  research  project  were  to  study  methods  to  enhance  understanding  of  locations  of  various  tissue 
and  structures  within  the  brain.  The  work  has  focused  primarily  on  improving  the  segmentation  of  the  brain’s 
ventricles  and  the  presentation  of  meaningful  and  efficient  visualizations  of  the  vascular  system.  The  ventricular  and 
vascular  systems  of  the  brain  are  two  structures  whose  accurate  visualization  can  aid  greatly  in  diagnostic  assessment, 
intervention  planning  and  guidance,  and  evaluation  of  treatment. 

Issues  in  the  registration  of  datasets  have  previously  been  studied  by  several  research  teams.  Several  techniques 
have  been  presented  for  dataset  registration.  The  traditional  registration  method  involves  the  use  of  a  stereotactic  frame 
rigidly  attached  to  a  patient.  Some  of  the  projects  reported  in  the  literature  have  used  external  fiducial  markers  that  are 
visible  in  the  acquired  data  for  registration.  For  example,  Adams  et  al.  [1],  the  Cleveland  Clinic  group  [3],  and  Malison 
et  al.  [22]  have  used  external  markers  to  aid  in  registration.  Frameless  surgery  is  also  possible.  For  example,  Peters 
and  colleagues  [30]  at  the  Montreal  Neurological  Institute  have  registered  datasets  by  manually  choosing  landmark 
points  on  the  surface  of  the  head.  They  calculated  the  transformation  that  aligns  the  data  points  using  least  squares. 
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Hemler  et  al.  [14]  have  registered  MR  and  CT  datasets  by  matching  polygonized  boundaries  of  the  head.  A  number 
of  researchers  have  attempted  to  register  patient  datasets  with  brain  atlas  data  using  deformation-based  techniques. 
For  example,  Christensen  at  al.  [5,  6]  have  used  models  of  elastic  solids  and  viscous  fluids  as  motivation  for  their 
deformation  method. 

Methods  to  segment  various  structures  of  interest  in  tomographic  data  have  been  investigated  by  many  research 
groups.  Fishman  et  al.  [8]  have  outlined  the  general  classes  of  segmentation  approaches  in  the  medical  imaging 
literature.  Some  of  the  segmentation  methods  for  structures  in  the  brain  include  the  methods  of  Li  et  al.  [21]  who 
segment  major  tissue  classes  (white  matter,  gray  matter,  cerebro-spinal  fluid,  and  potential  tumors)  in  MR  datasets  using 
symmetry,  morphological  operations,  and  measures  of  segmented  region  size.  Arata  et  al.  [2]  have  used  histogramming 
and  K-means  clustering  to  segment  ventricles,  sulci,  and  white  matter  lesions  from  proton  density  and  T2- weighted 
data.  Kapouleas  [17]  has  extracted  the  outline  of  the  brain  and  suspected  Multiple  Sclerosis  lesions  using  local  edge 
detection  in  proton  density  and  T2-weighted  data. 

3  Overview  of  Efforts  in  Summer  1996 

In  the  1996  Summer  Faculty  Research  Program,  extraction  of  certain  anatomical  structures  in  head  MR  images  was 
investigated.  The  goal  of  that  effort  was  to  first  determine  anatomical  landmarks  that  were  appropriate  to  allow  the 
registration  of  multiple  modalities  of  MR.  Registration  (that  is,  alignment)  of  MR  datasets  offers  the  potential  of 
exploiting  the  capabilities  of  each  modality  in  an  integrated  fashion.  (For  example,  the  vasculature  tends  to  have  the 
highest  contrast  with  other  tissue  in  MRA  data,  making  MRA  the  best  MR  modality  for  studying  the  vasculature.  In 
T2- weighted  images,  edema  caused  by  tumor  cell  invasions  is  highly  contrasted  with  other  tissue  [19],  If  the  relative 
orientation  of  the  subject  in  each  dataset  is  known,  the  ensemble  of  data  can  allow  a  wide  variety  of  useful  structures’ 
relative  positions  to  be  better  understood.)  Several  landmarks  seemed  to  be  visually  apparent  in  the  four  modalities  of 
MR  data  available  for  the  project.  These  included  the  eyes,  the  lateral  ventricles,  and  the  longitudinal  fissure  of  the 
brain.  These  landmarks  could  be  used  to  compute  the  transformations  that  align  the  datasets. 

After  the  appropriateness  of  these  anatomical  landmarks  was  determined,  techniques  were  developed  and/or 
applied  to  the  task  of  determining  landmark  location  in  each  image  modality.  The  techniques  will  be  briefly  described 
here.  A  more  complete  description  is  available  in  the  1996  program  final  report  [24]  and  in  a  1997  conference 
paper  [27], 

An  algorithm  that  the  PI  had  previously  developed  for  extraction  of  the  kidney,  kidney  collecting  system,  kidney 
cysts,  and  other  nearby  organs  in  lower  torso  CT  data  [29]  was  applied  to  the  problem  of  segmenting  the  position 
of  the  lateral  ventricles  in  the  four  modalities  of  MR  data.  The  method  requires  a  human  to  select  an  initial  seed  in 
the  structure  of  interest  and  uses  multiple  stages  of  volume-growing,  morphological  refinement,  and  locally  adaptive 
histogramming  to  achieve  a  final  segmentation. 

For  extraction  of  the  longitudinal  fissure  of  the  brain,  the  method  of  Brummer  [4]  was  applied.  Brummer’s 
method  assumes  that  the  longitudinal  fissure  can  be  well-modeled  by  a  plane,  and  he  proposed  an  efficient  method  to 
automatically  determine  that  plane  in  MR  data.  During  the  1996  summer  program,  we  began  implementation  of  the 
Brummer  method,  but  were  unable  to  complete  its  implementation  before  the  end  of  the  program. 
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Most  of  the  effort  during  the  summer  involved  development  of  a  new  algorithm  to  automatically  locate  the  eyes 
in  the  four  modalities  of  MR  data  available.  The  method  developed  exploits  the  spherical  shape  of  the  eye  to  achieve 
segmentation  of  the  eyes.  The  method  uses  the  Hough  Transform  circle  detection  scheme  [20]  to  find  circular  regions 
in  edge  images  formed  from  each  slice.  The  set  of  circles  is  then  clustered  into  sets.  Finally,  the  method  of  Hall  et 
al.  [12]  is  used  to  determine  the  best-fitting  sphere  to  each  set,  and  the  two  best-fitting  spheres  which  have  radii  within 
the  allowable  range  for  eyes  are  accepted  as  the  position  of  the  eyes.  A  detailed  description  of  the  eye  extraction 
technique  is  available  in  [24]  and  [27]. 

4  Extension  Program  Effort 

In  the  followon  project  to  the  1996  summer  effort,  we  improved  the  segmentation  of  the  ventricular  system  and 
developed  methods  to  improve  classification  of  tissue  representing  blood  vessels.  In  addition,  we  developed  methods 
to  improve  final  rendering  and  visualization  of  these  structures.  In  this  section,  the  methods  will  be  briefly  described 
and  some  results  will  be  exhibited. 

4.1  Segmentation  and  Visualization  of  Vasculature 

The  methods  for  segmenting  and  visualizing  vasculature  will  be  discussed  first.  The  effort  to  localize  vasculature  and 
to  present  an  effective  visualization  began  with  an  investigation  of  the  maximal  intensity  projection. 

4.1.1  MIP 

Maximal  intensity  projection  (MIP)  is  one  method  to  generate  a  direct  volume  rendering.  Direct  volume  rendering 
can  be  contrasted  with  the  traditional  method  of  graphical  rendering,  the  surface-based  rendering.  In  surface-based 
rendering,  the  output  image  is  constructed  by  applying  a  physically-motivated  lighting  model  to  surfaces  in  a  scene. 
Volume-based  rendering  does  not  rely  on  the  explicit  determination  of  surface  features  and  subsequent  application  of 
a  lighting  model  to  the  surface  features. 

The  MIP  is  a  two  dimensional  representation  of  a  three  (or  higher)  dimensional  dataset.  Each  point  Pi  in  the  MIP 
is  formed  from  casting  a  ray  that  originates  at  the  point  and  passes  through  the  volume.  The  intensity,  Iiy  of  Pj  is  set 
to  the  maximal  intensity  value  of  all  the  voxels  intersected  by  the  ray  as  it  passes  through  the  volume. 

One  example  maximal  intensity  projection  image  from  a  brain  MRA  dataset  is  shown  in  Figure  1.  In  this  image, 
the  rays  were  projected  perpendicular  to  the  image  plane  and  along  one  of  the  axes  of  the  grid  of  MRA  data.  Because 
flow  is  captured  as  high  intensity  voxels  in  MRA  data,  blood  flow  tends  to  be  one  of  the  highest  contrast  tissues  in 
MRA.  The  whitish  trails  in  the  image  are  the  blood  vessels. 

MIP  images  typically  have  an  X-ray-like  appearance.  While  these  images  allow  an  observer  to  discern  the  general 
pattern  of  the  blood  vessels  in  the  volume,  the  relative  three-dimensional  locations  of  the  blood  vessels  is  not  apparent. 
However,  we  have  experimented  with  two  techniques  that  allow  a  viewer  to  have  an  improved  perception  of  the  relative 
locations  of  the  blood  vessels. 

The  first  technique  involves  the  generation  of  a  series  of  MIP  images  from  an  image  plane  which  is  rotated  in 
a  circular  orbit  around  the  central  axis  of  the  volume.  By  performing  MIP  at  regular  intervals  in  this  orbit  and  then 
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Figure  1 :  Sample  Maximal  Intensity  Projection  on  brain  MRA  dataset. 


“playing”  the  sequence  of  images  as  a  cinematic  loop,  a  viewer  perceives  the  position  and  depth  of  many  of  the 
major  blood  vessels.  We  have  placed  one  example  animative  rendering  on  the  web.  It  can  be  viewed  at  the  URL 
http://www.cs.uah.edu/visgig/K12/images/animation.gif. 


Figure  2:  3D  rendering  of  volume  of  MIP  points. 

The  second  visualization  technique  involves  generating  a  3D  volume  using  the  MIP  image.  If  the  position  of  the 
maximal-valued  voxel  for  each  ray  is  recorded,  a  three-dimensional  representation  of  the  blood  vessels  can  be  created. 
Standard  graphics  depth-cueing  can  be  applied  to  the  realization  of  these  points  to  allow  a  user  to  perceive  the  position 
of  the  points.  In  addition,  the  points  can  be  viewed  in  three  space  using  the  3D  rendering  capabilities  of  a  visualization 
package.  An  example  visualization  is  shown  in  Figure  2.  The  MIP  volume  is  shown  in  the  middle  of  the  image  and 
a  projection  of  that  data  to  a  bounding  plane  is  shown  in  the  lower  left  comer  of  the  image.  The  resultant  image  can 
also  be  viewed  using  a  rainbow  colormap  to  display  the  depths.  A  black  and  white  rendering  of  the  color  image  is 
shown  in  Figure  3.  (The  color  image  reveals  much  more  information,  but  reproduction  limitations  restrict  us  to  black 
and  white  copy  in  this  report.)  It  is  possible  for  clinical  staff  to  exploratively  interact  with  this  data;  the  user  can  rotate 
and  translate  the  data  to  view  it  from  different  directions  and  to  thereby  understand  the  spatial  relationships  of  the 
structures. 

Two  sample  visualizations  of  the  vasculature  are  shown  in  Figures  4  and  5.  The  visualizations  were  computed  on 
the  MRA  dataset  of  a  healthy  patient.  The  images  displayed  in  these  figures  are  black  and  white  representations  of 
color  images  that  were  synthesized  in  the  research  project.  These  visualizations  show  maximal  intensity  projections 
from  three  orthogonal  directions  that  are  projected  onto  the  sides  of  the  cube  which  bounds  the  data.  In  addition,  a 
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Figure  3:  Black  and  white  rendering  of  depth-colorized  MIP. 


fourth  volumetric  projection  from  an  oblique  direction  is  shown  as  a  shadow  below  the  cube. 


Figure  4:  Visualization  showing  3  MIPs  projected 
to  bounding  cube. 


Figure  5:  Second  visualization  showing  3  MIPS 
projected  to  bounding  cube. 


We  are  currently  investigating  using  a  collection  of  MIPs  for  visualization.  Those  MIPs  would  be  collected  from 
viewpoints  that  are  approximately  equally  distributed  on  the  surface  of  a  sphere  that  contains  the  volume  and  whose 
centroid  is  coincident  with  the  centroid  of  the  volume.  We  are  investigating  if  a  collection  of  many  MIPs  in  MRA  data 
begins  to  approach  the  appearance  of  an  isosurface  rendering  on  the  data.  We  have  collected  isosurface  renderings 
with  fixed  thresholds  on  the  MRA  data,  but  the  isosurface  renderings  are  not  reproduced  here.  We  want  to  determine 
how  many  MIP  viewpoints  are  necessary  to  reasonably  reproduce  the  position  of  nearly  all  of  the  vasculature  and  to 
determine  how  many  viewpoints  are  needed  before  that  data  approaches  the  set  of  isosurface  data. 

The  advantage  of  MIP  over  isosurface  extraction  is  that  the  MIP  method  does  not  require  a  user  to  establish  a 
threshold  level,  thus  MIP  may  have  the  potential  for  more  automatic  extraction  and  visualization  without  the  need  to 
carefully  select  threshold  parameters.  We  are  continuing  to  investigate  this  potential. 
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4.1.2  Linking  in  Projection  Images  I  -  2D  distance 


The  first  attempt  at  improving  the  MIP  data  to  allow  better  visualization  of  vasculature  and  in  turn  more  useful  clinical 
application  involved  attempting  to  link  some  of  the  blood  vessels  that  were  disconnected  in  binarizations  of  MIP 
images.  Due  to  partial  volume  effects  and  some  in-plane  flow,  vasculature  in  the  MRA  data  is  not  always  uniformly 
bright.  As  a  result,  there  tends  to  be  no  fixed  threshold  that  can  satisfactorily  reveal  only  vasculature,  whether  that 
threshold  be  applied  as  the  isosurface  level  or  as  a  threshold  on  the  result  of  maxima]  intensity  projection  images. 
Consequently,  vasculature  tends  to  be  disconnected  in  any  2D  or  3D  renderings  of  MRA  data. 

To  connect  the  disconnected  vessels  in  the  processed  data,  we  first  used  a  linkage  technique  based  on  the  well- 
known  distance  transform.  Two  other  techniques  for  linkage  were  considered  but  only  cursorily  explored.  These 
included  morphological  closing  and  volume  growing  in  T1  data  using  seed  points  from  the  highest  intensity  voxels 
in  the  MIP  on  the  MRA  data.  Morphological  closing  tended  to  produce  unwanted  effects  as  it  closed  gaps  between 
vessels,  although  possibly  subsequent  processing  could  be  developed  to  overcome  such  artifacts.  Volume-growing  in 
T1  data  was  not  attempted  because  the  amount  of  vasculature  that  is  visually  distinguishable  in  T1 -weighted  MR  data 
appeared  to  be  less  than  that  of  MRA;  we  believe  that  fewer  vessels  would  be  extracted  in  Tl.  However,  it  might  be 
possible  to  improve  the  segmentation  of  a  few  key  vessels  by  coupling  Tl  region-growing  of  blood  vessels  with  the 
MRA-based  methods. 

Our  linkage  method  used  both  pixel  distance  and  intensity.  The  standard  Distance  Transform  links  segments  based 
only  on  their  distance.  Thus,  linear  segments  tend  to  result  from  Distance  Transform-based  linkage.  By  using  the 
intensity  of  the  pixels  in  the  original  MIP  data,  the  path  of  linkage  can  be  influenced  when  there  is  intensity  evidence 
indicating  the  potential  of  a  blood  vessel  within  tissue.  Our  method  weights  both  the  distance  and  intensity  to  do  the 
linkage. 

The  linking  method  first  identifies  isolated  components.  Components  consisting  of  a  single  pixel  are  removed 
from  consideration  to  avoid  linking  with  isolated  noise  points.  Next,  for  each  component,  the  lowest  cost  path  to  any 
nearby  component  is  found.  Our  method  allows  paths  of  distance  at  most  7  pixels. 

Improved  perception  of  vasculature  is  possible  if  the  linked  pixels  are  added  to  volumetric  representations  computed 
from  single  or  multiple-direction  MIP  data. 

Examples  of  the  MIP  projections  from  two  directions  are  shown  in  Figures  6  (a)  and  (b).  A  binarizing  thresholding 
of  these  projections  are  shown  in  (c)  and  (d).  The  bottom  two  images,  Figures  6(e)  and  (f),  show  the  improvement 
from  applying  the  Distance  Transform-based  edge  linkage.  The  number  of  isolated  vessel  regions  is  reduced  in  the 
edge-linked  images. 

4.1.3  Linking  in  Projection  Images  II  -  3D  distance 

A  three-dimensional  linking  algorithm  based  on  a  3D  distance  transform  was  also  investigated.  The  algorithm  is  a 
three  dimensional  analogue  of  the  essentially  2D  linking  in  individual  MIPs.  The  paths  of  linkage  are  computed  in 
the  volume  of  data  that  is  formed  from  the  sparse  3D  representation  of  a  single  (or  of  multiple)  MIPs.  That  volume 
contains  data  only  for  voxels  that  were  a  maximal  point  on  one  of  the  projection  rays  of  one  (or  more)  MIPs.  By 
linking  disconnected  components  in  such  data,  a  true  three  dimensional  representation  of  the  vasculature  can  be  found. 

The  method  requires  large  memory  spaces  to  execute  fairly  efficiently. 
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(e)DTw/Int.,30d.  (f)  DT  w/ Int.,  60  d. 

Figure  6:  MIP  Projections  and  Vascular  Linkage. 


In  our  experiments,  we  have  thresholded  the  output  from  the  MIP;  low-intensity  points  from  MIP  are  rejected 
before  computing  the  volume  linking. 

Experiments  are  currently  being  conducted  to  ascertain  the  impact  of  the  number  of  MIP  projections  used  to 
form  the  initial  volume.  All  of  the  MIP  linkage  experiments  aim  to  overcome  the  limitations  caused  by  selection  of 
a  threshold  that  removes  much  of  the  background  “noise”  found  by  the  MIP  method  while  also  removing  some  of 
the  vasculature.  The  exhibited  images  demonstrate  the  improvement  in  segmentation  and  visualization  of  vasculature 
using  the  linkage. 

4.1.4  Linking  Using  Probabilistic  Relaxation 

The  PI  has  recently  developed  a  technique  for  segmentation  of  vehicles  in  synthetic  aperture  radar  (SAR)  [26].  That 
method  involves  a  histogram-based  initial  segmentation  followed  by  a  series  of  refinement  stages  that  link  disconnected 
regions  in  an  image.  We  attempted  to  apply  this  method  to  the  individual  2D  MIP  data,  both  for  segmentation  and  then 
for  linking  of  the  vessel  segments.  However,  the  initial  segmentations  were  not  good,  and  the  final  linkings  seemed  to 
be  inferior  to  the  other  methods  we  tested. 

Full  details  of  the  method  were  presented  in  [26].  The  method  computes  a  histogram  for  a  slice  image  and 
attempts  to  find  the  division  between  the  first  and  second  mode.  Hierarchical  clustering  is  then  used  to  group  pixels 
that  are  members  of  the  second  mode.  The  clustering  uses  a  weighting  based  on  both  pixel  position  and  intensity.  A 
relabeling  step  motivated  by  the  probabilistic  relaxation  labeling  is  used  to  create  a  final  segmentation.  The  relabeling 
tends  to  link  disconnected  components. 

Additional  investigation  is  required  to  be  able  to  successfully  apply  this  method  to  the  vascular  segmentation 
problem  in  MRA  data.  The  MRA  data  tends  to  have  three  modes,  including  the  background  around  the  head,  skin  and 
fatty  tissue,  and  everything  else.  The  third  mode  is  the  one  that  includes  the  blood  vessels,  although  the  vessels  are  far 
out  in  the  tail  of  the  mode.  Thus,  histogram-based  segmentations  of  vasculature  in  the  MIP  data  may  be  difficult. 
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4.2  Ventricle  Segmentation  and  Visualization 

In  the  follow-on  effort,  the  segmentation  method  used  to  segment  the  brain’s  ventricles  was  refined.  The  technique 
produces  excellent  segmentations  of  the  lateral  ventricles,  although  the  smaller  third  and  fourth  ventricles  are  more 
difficult  to  segment.  A  journal  paper  describing  the  method  has  been  submitted  recently  [28]. 

The  method  was  originally  applied  to  the  segmentation  of  the  kidneys,  kidney  collecting  system,  cysts  and  tumors 
on  and  adjacent  to  ihe  kidneys,  and  other  organs  in  lower  torso  CT  data. 

There  are  two  major  stages  to  the  method.  The  first  stage  involves  volume-growing  from  a  human-supplied  seed. 
Thus,  the  users  of  the  method  chooses  a  voxel  that  is  within  a  lateral  ventricle.  A  volume  is  grown  out  from  the  seed  point 
using  connectedness  and  prior  knowledge  about  intensity  distribution  of  CSF  in  MR  data.  The  first  stage  produces 
a  coarse  segmentation  of  the  lateral  ventricles.  The  second  stage  of  processing  entails  a  morphological  operation 
followed  by  locally  adaptive  histogramming.  The  locally  adaptive  histogramming  refines  the  initial  segmentation.  A 
histogram  is  computed  in  a  region  of  each  slice  about  the  original  segmented  volume.  This  second  stage  of  processing 
was  the  major  refinement  of  the  segmentation  method  during  the  extension  project. 

The  second  stage  of  processing  involves  first  dilating  the  volume-grown  structure  based  on  the  size  of  the  structure. 
For  each  slice,  a  histogram  is  computed  for  the  dilated  region.  In  the  slice  S{  of  the  initial  seed,  the  histogram  mode 
Mi  that  contains  the  seed  is  automatically  identified.  Then,  for  the  slices  Si+1  and  S{- 1  adjacent  to  that  slice,  the 
peak  of  mode  is  taken  as  a  point  that  defines  the  mode  of  interest  in  those  slices.  This  process  is  repeated  for 
successive  slices;  the  peak  of  mode  Mj  in  slice  Sj  is  taken  as  a  point  within  the  mode  of  interest,  Mj+1  of  slice  SJ+1 . 
This  process  defines  the  mode  of  interest  in  each  slice,  and  the  pixels  that  fall  within  that  mode  are  taken  as  the  final 
segmentation  of  the  structure  of  interest  within  the  node. 

An  example  showing  an  initial  image  and  the  final  segmentation  of  one  ventricle  is  shown  in  Figures  7  and  8, 
respectively.  Two  shaded  three-dimensional  renderings  of  that  extracted  ventricle  are  shown  in  Figure  9 


Figure  7:  Tl-weighted  slice  containing  ven-  Figure  8:  The  portion  of  extracted  left  lateral 

tnc^es-  ventricle  in  the  same  slice  as  at  left. 
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Figure  9:  Shaded  3D  Renderings  of  Extracted  Ventricle. 


4.3  Miscellaneous  Additional  Action  Items 

The  PI  also  was  able  to  observe  an  image-guided  pedicle  screw  surgery  at  the  Cleveland  Clinic  during  the  project. 
Additionally,  dicom-format  images  from  a  patient  with  artero- venous  malformation  were  obtained  from  the  Cleveland 
Clinic.  These  images  will  be  used  in  future  investigations  and  to  validate  the  work. 

5  Conclusion  and  Future  Work 

This  document  has  presented  a  summary  of  the  research  that  was  conducted  in  our  1996  Summer  Research  Extension 
Program.  Methods  for  the  improved  classification  and  visualization  of  tissue  representing  cerebro  spinal  fluid  (CSF; 
in  particular,  the  ventricular  system)  and  the  vascular  system  of  the  brain  have  been  developed.  These  techniques 
can  be  helpful  for  diagnostic  assessment,  intervention  planning,  and  treatment  assessment.  Clinical  staff  can  better 
understand  relative  anatomical  location  of  structures  of  interest  through  our  techniques. 

5.1  Future  Efforts 

Future  research  efforts  will  extend  the  methods  that  we  have  developed.  We  are  currently  exploring  the  registration  of 
MRA  datasets  collected  at  different  times.  We  are  investigating  methods  to  extract  landmark  points  and  curves  from 
the  vascular  tree  that  are  suitable  for  registration.  In  addition,  we  are  developing  techniques  to  automate  the  selection 
of  the  seed  point  for  the  ventricle  segmentation. 

To  date,  our  work  in  extracting  the  third  and  fourth  ventricles  has  been  cursory.  These  smaller  structures  do  not 
seem  to  be  as  clearly  distinguished  in  all  four  data  modalities  as  do  the  larger  lateral  ventricles.  Future  work  is  needed 
to  more  robustly  classify  those  structures.  In  addition,  only  cursory  attention  was  focused  on  distinguishing  between 
gray  and  white  matter.  Preliminary  work  suggests  that  histogram-based  techniques  in  T2  data  may  allow  coarse 
segmentation  of  these  tissue  types.  Those  coarse  results  are  perhaps  validated  by  the  work  of  Homak  and  Fletcher  [16], 
who  have  previously  reported  success  in  segmenting  gray  and  white  matter  using  multidimensional  histograms. 


5.2  Demonstrable  Achievements 

One  journal  paper  [28]  has  recently  been  submitted  that  is  based  partly  on  the  work  described  in  the  extension 
project.  That  paper  is  currently  under  review.  An  additional  journal  submission  is  planned  to  present  the  method  for 
eye  extraction.  A  conference  paper  will  be  submitted  soon  that  discusses  our  vascular  segmentation  and  visualization 
method.  Finally,  one  conference  paper  [27]  has  been  presented  that  discusses  the  early  stages  of  our  work  in  anatomical 
structure  classification  in  the  brain.  That  paper  seemed  to  have  a  good  audience  reception  at  the  conference. 

The  project  has  produced  some  progress  toward  the  goals  of  allowing  more  clear  visualization  of  anatomical 
structures  in  the  brain.  We  hope  that  future  work  will  be  possible  to  improve  the  methods  further. 
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Abstract 


The  ray  tracing  algorithm,  often  used  in  computer  graphics  for  creating  a  2-D 
picture  of  a  3-D  world,  is  computationally  intensive.  Many  attempts  have  been  made  in 
the  past  to  reduce  the  processing  time  taken  by  this  algorithm.  Related  work  can  be  found 
in  references  [1]  to  [5],  In  this  research  project,  a  systolic  array  architecture  and  its 
functional  modules  are  developed  for  implementing  the  algorithm  on  Field 
Programmable  Gate  Array(s).  Since  FPGA(s)  are  programmable  in  real-time,  this  method 
has  a  definite  advantage  over  conventional  ones.  It  is  hoped  that  the  work  will  lead  to  the 
successful  implementation  of  the  algorithm  on  both  the  CHAMP  and  DRASTIC  field 
programmable  boards.  Since  the  ray  tracing  algorithm  is  also  used  in  the  Xpatch  software 
used  for  accurate  electromagnetic  scattering  predictions,  this  work  will  also  contribute 
toward  reducing  the  processing  time  of  the  Xpatch  ray  tracer. 
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FPGA  Implementation  of  the  XPATCH  Ray  Tracer 

Mohammed  Y.  Niamat 


Introduction:  The  Rav  Tracing  Problem 

For  a  given  starting  point  R(o),  and  a  starting  direction  R(d),  the  ray  tracing  problem  is  to 
determine  if  the  ray  defined  by 

R(t)  =  R(o)  +  R(d)  *  t,  t  >  0 

hits  the  target.  If  it  does,  the  next  question  that  needs  to  be  answered  is  “where  is  the  hit- 
point  ?  ”  To  accomplish  these  before  ray  tracing  a  target,  a  “tree”  of  the  target  is  built. 
The  tree  contains  information  on  how  the  facets  are  distributed  in  the  space  and  is  used 
to  identify  those  facets  that  are  likely  to  be  hit  by  a  ray.  In  xpatchf,  the  tree  is  based  on 
an  algorithm  called  ‘binary  space  partitioning’  (BSP)  . 

In  order  to  speed  the  computations  for  ray-object  intersection,  the  object  is  often  enclosed 
in  a  bounding  volume.  The  bounding  volume  can  be  of  different  shapes.  Common 
bounding  volumes  being  spheres,  and  rectangular  boxes.  Kay  and  Kajiya  [],  used  the 
concept  of  slabs  to  form  rectangular  boxes  as  bounding  volumes.  A  slab  is  simply  the 
space  between  two  parallel  planes.  The  intersection  of  a  set  of  slabs  defines  the  bounding 
volume.  The  method  relies  on  intersection  of  each  pair  of  slabs  by  the  ray,  keeping  track 
of  the  near  and  the  far  intersection  distances.  If  the  largest  near  value  is  greater  than  the 
smallest  far  value,  then  the  ray  misses  the  bounding  volume;  otherwise,  it  hits. 

The  Algorithm 

One  of  the  simplest  finite  bounding  volumes  is  the  intersection  of  two  parallel  planes 
each  aligned  so  that  their  normals  are  in  the  same  direction  as  the  X,  Y,  and  Z  axes.  The 
following  algorithm  determines  whether  the  bounding  box  is  hit  by  the  ray  ,  or  not  (miss). 

Define  the  orthogonal  box  by  two  coordinates: 

box’s  minimum  extent  =  B(0)  =  [BBP(0,0)  BBP(0,1)  BBP(0,2)] 
box’s  maximum  extent  =  B(l)  =  [BBP(1,0)  BBP(1,1)  BBP(1,2)] 

The  origin  and  direction  of  the  ray  is  defined  by: 

R(origin)=  R(0)  =  [rs(0)  rs(l)  rs(2)] 

R(direction)  =  R(d)  =  [rd(0)  rd(l)  rd(2)] 
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The  ray  can  then  be  defined  as  the  set  of  points  on  a  line  described  by  the  following 
equation: 


R(t)  =  R(0)  +  R(d)  *  t,  where  t  >  0 

The  above  equation  can  be  broken  up  into  its  three  axes  components  as  follows: 

Along  X-  Axis:  X(t)  =  rs(0)  +  rd(0)  *  t 

Along  Y-Axis:  Y(t)  =  rs(l)  +  rd(l)  *  t 

Along  Z_  Axis:  Z(t)  =  rs(2)  +  rd(2)  *  t 

In  the  above  equations,  0, 1,  and  2  represent  the  X,  Y,  and  Z  axes,  respectively. 

Since  X(t),  Y(t),  and  Z(t)  satisfy  all  points  on  the  X,  Y,  and  Z  components  of  the  ray,  they 
must  also  satisfy  the  points  on  the  minimum  and  maximum  extents  of  the  box.  Therefore, 

Along  X-  Axis:  BBP(0,0)  =  rs(0)  +  rd(0)  *  t 

BBP(1,0)  =  rs(0)  +  rd(0)  *  t 

Along  Y-  Axis :  BBP(0, 1 )  =  rs(  1 )  +  rd(  1 )  *  t 

BBP(1,1)  =  rs(l)  +  rd(l)  *  t 

Along  Z-axis:  BBP(0,2)  =  rs(2)  +  rd(2)  *  t 

BBP(1,2)  =  rs(2)  +  rd(2)  *  t 

From  the  above  equations,  ‘t’  can  be  computed  for  each  of  the  above  six  equations.  For 
example,  for  the  X-axis: 

tl  =  [BBP(0,0)  -  rs(0)]  /rd(0),  for  the  minimum  extent  of  the  box,  and 
t2  =  [BBP(1,0)  -  rs(0)]  /  rd(0),  for  the  maximum  extent  of  the  box. 

The  ray  algorithm  for  the  intersection  of  the  box  ,  can  be  described  as  follows: 

Set  tnear  =  minus  infinity,  and  tfar  =  plus  infinity  (i.e.,  arbitrarily  large  values). 

For  each  pair  of  planes  associated  with  X,  Y,  and  Z  (described  here  for  the  set  of  X 
planes) : 

If  the  direction  rd(0)  is  equal  to  zero,  then  the  ray  is  parallel  to  the  planes, 
so: 

If  the  origin  is  not  between  the  slabs,  i.e.,  rd(0)  <  BBP(0,0),  or 
rd(0)  >  BBP(1,0),  then  the  ray  misses  the  box. 

Else,  if  the  ray  is  not  parallel  to  the  planes,  then 

begin: 
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Calculate  intersection  distances  of  planes: 

tl  =  [BBP(0,0)-rs(0)]/rd(0) 
t2  =  [BBP(l,0)-rs(0)]/rd(0) 

If  tl  >  t2,  swap  tl  and  t2 
If  tl  >  tnear,  set  tl  =  tnear 
If  t2  <  tfar ,  set  t2  =  tfar. 

If  tnear  >  tfar ,  box  is  missed 

If  tfar  <  0,  box  is  behind  ray,  so  box  is  missed 

end. 

end  of  for  loop. 

If  the  box  is  hit,  the  intersection  distance  is  equal  to  tnear  and  the  ray’s  exit  point  is  tfar. 
Figure  1(a)  illustrates  the  case  where  the  ray  hits  the  box.  Notice  that  tnear  is  less  than 
tfar  for  this  case.  In  Figure  1(b),  tnear  is  greater  than  tfar,  and  therefore  the  ray  misses  the 
box. 

The  various  symbols  used  in  calculating  the  ray/box  intersection  are  tabulated  below. 


Symbols 

Meaning 

BBP(0,  0) 

Minimum  extent  of  the  Bounding  Box  Parent  in  the  x-direction 

BBP(1,  0) 

Maximum  extent  of  the  Bounding  Box  Parent  in  the  x-direction 

BBP(0,  1) 

Minimum  extent  of  the  Bounding  Box  Parent  in  the  y-direction 

BBP(1,  1) 

Maximum  extent  of  the  Bounding  Box  Parent  in  the  y-direction 

BBP(0,2) 

Minimum  extent  of  the  Bounding  Box  Parent  in  the  z-direction 

BBP(1,2) 

Maximum  extent  of  the  Bounding  Box  Parent  in  the  z-direction 

rs(0) 

X-component  of  the  origin  (  starting  point)  of  the  ray 

rs(l) 

Y-component  of  the  origin  ( starting  point)  of  the  ray 

rs(2) 

Z-component  of  the  origin  (  starting  point)  of  the  ray 

rd(0) 

X-component  of  the  directional  vector  of  the  ray 

rd(l) 

Y-component  of  the  directional  vector  of  the  ray 

rd(2) 

Z-component  of  the  directional  vector  of  the  ray 

tnear 

Distance  of  the  nearest  ray/box  intersection  point  from  the  origin  of  ray 
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The  following  numerical  example  illustrates  the  ray/box  intersection  problem. 
Example 


Given  a  ray  with  origin  [0  4  2]  and  direction  [0.218  -  0.436  0.873]  and  a  box  with 
comers: 

B(minimum)  =  [-1  2  1] 

B(maximum)  =  [3  3  3] 

find  if  the  ray  hits  the  box. 

Here,  rs(0)  =  0,  rs(l)  =  4,  and  rs(2)  =  2; 

rd(0)  =  0.218,  rd(l)  =  -  0.436,  and  rd(2)  =  0.873; 

BBP(0,0)  =  -1,  BBP(0,1)  =  2,  and  BBP(0,2)  =  1, 

BBP(1,0)  =  3,  BBP(1,1)  =  3,  and  BBP(1,2)  =  3. 

The  algorithm  begins  by  looking  at  the  X  slab,  defined  by  BBP(0,0)  and  BBP(1,0).  The 
distances  tl  and  t2  are  computed  as: 

tl  =  [BBP(0,0)  -  rs(0)]  /  rd(0) 

=  [-1  -  0]  /  0.218  =  -  4.59 

t2  =  [BBP(l,0)-rs(0)]/rd(0) 

=  [3-0]/ 0.218  =  13.8 

After  this  stage,  tl  is  set  to  tnear,  and  tfar  to  t2.  Therefore  tnear  becomes  -  4.59,  and  tfar 
13.8.  Neither  tnear  >  tfar  nor  tfar  <  0,  so  the  Y  slab  is  examined.  For  the  Y  slab: 

tl  =  [BBP(0,l)-rs(l)]/rd(l) 

=  [2  -  4]  /  -0.436  =  4.59 

t2  =  [BBP(l,l)-rs(l)]/rd(l) 

=  [3  -  4]  /  -0.436  =  2.29 

Since  tl  >  t2,  swap  these  values.  The  new  tnear  =  2.29,  and  the  new  tfar  =  4.59.  Next,  we 
check  for  the  Z  slab. 

tl  =  [BBP(0,2)  -  rs(2)]  /  rd(2) 

=  [1  -2]/ 0.873  =-1.15 

t2  =  [BBP(l,2)-rs(2)]/rd(2) 

=  [3-2]/ 0.873  =  1.15. 


20-7 


At  this  stage,  tnear  is  not  updated,  and  remains  2.29.  However,  tfar  is  updated  to  1.15. 
Since  tnear  >  tfar,  the  ray  misses  the  box. 


Design  Methodology 

The  following  design  steps  are  outlined  to  implement  the  ray  tracing  algorithm  on  the 
FPGAs: 

1 .  Parallelize  the  algorithm. 

2.  Develop  systolic  arrays  for  the  algorithm. 

3.  Synchronize  the  data  items  maintaining  nearest  neighbor  connection. 

4.  Check  theoretical  output  and  obtain  performance  parameters  (pipelining  period,  etc.). 

5.  Develop  individual  modules  used  in  the  overall  design. 

6.  Interface  the  various  modules  to  obtain  the  complete  design. 

7.  Simulate  and  verify  design. 

8.  Map  design  on  Xilinx  FPGA(s)  using  Xilinx’s  Ml  Foundation  and  other  CAD  tools. 


Proposed  Architecture 

The  proposed  systolic  array  architecture  for  finding  the  ray/box  intersection  is  shown  in 
Figure  2.  It  consists  of  nine  P.E  s  (processing  elements)  connected  in  a  systolic-type 
configuration.  Each  P.E.  contains  a  subtractor  and  a  multiplier.  The  first  row  computes 
the  parameters  tl  and  t2  for  the  X-  component,  as  defined  earlier.  The  second  and  third 
rows  compute  parameters  tl  and  t2  for  the  Y  and  Z  axes.  The  inputs  rs(0),  rs(l),  and 
rs(2),  which  define  the  starting  point  or  origin  of  the  ray,  are  fed  from  the  left-hand  sides 
of  row  1,  row  2,  and  row  3,  respectively.  The  inverses  for  each  of  the  directional  data 
items,  namely,  rd(0),  rd(l),  and  rd(2)  are  first  computed  (not  shown  in  figure).  The 
results  are  then  fed  from  the  right  hand  side  of  the  array  as  shown  in  the  figure.  The  data 
items  which  define  the  minimum  and  maximum  extent  of  the  bounding  box  -  BBP(0,0), 
BBP(1,0),  BBP(0,1),  BBP(1,1),  BBP(0,2),  and  BBP(1,2)  are  fed  from  the  top  of  the 
array.  The  outputs  tl  and  t2,  for  each  of  the  axis,  are  obtained  from  the  bottom  of  the  cells 
as  shown  in  the  figure.  The  outputs  tl  for  each  of  the  axis  are  fed  into  a  comparator 
circuit.  The  maximum  of  the  three  tl  parameters  is  determined.  This  value  is  designated 
as  tnear.  In  a  similar  fashion,  the  minimum  of  the  three  t2  parameters  is  determined  and 
designated  as  tfar.  If  tnear  is  greater  than  tfar,  the  ray  misses  the  bounding  volume  box, 
otherwise,  the  ray  intersects  with  the  box  and  a  hit  is  registered. 
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In  a  real  target  recognition  application,  it  is  expected  that  a  considerably  large  number  of 
rays  will  be  fired  in  parallel  at  the  target.  This  number  could  be  as  high  as  25  million.  In 
order  to  increase  the  speed  of  computations  for  a  large  number  of  rays,  the  architecture 
shown  in  Figure  2  can  be  duplicated.  This  will  reduce  the  time  for  processing  the  rays  to 
half.  However,  the  number  of  processing  elements  will  double  to  18.  If  the  architecture  is 
replicated  one  more  time,  the  number  of  cells  would  increase  to  27.  If  computation  of  the 
‘plandisf  parameter  is  neglected  in  the  architecture,  the  number  of  cells  used  would  be  6, 
12,  and!8  for  the  three  cases  under  discussion. 
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Figure  2  (b):  Systolic  Array,  Time  Snapshot  2 


In  order  to  implement  the  proposed  systolic  architecture,  modules  for  the  following 
components  were  developed  and  tested. 

1 .  Carry  Look-Ahead  Adder 

2.  Wallace  Tree  Multiplier 

3.  Parallel  in-  Parallel  Out  Shift  Right  Register 

4.  Parallel  in-  Parallel  Out  Shift  Left  Register 

5.  Two’s  Complementer 

6.  Adder 


1.  Module:  Carry  Look  -Ahead  Adder 

The  Carry  Look-Ahead  module  developed  for  this  project  is  shown  in  Figure  3.  It  is 
designated  as  CLA2  and  functions  as  a  2-bit  Carry  Look-Ahead  adder  .  The  components 
of  the  CLA2  include  ten  NAND  and  two  Inverter  gates.  The  following  table  summarizes 
the  functions  of  the  various  input  and  output  parameters  of  the  CLA2. 


Input  Parameter 

Function  Description 

A 

Augend 

l_  B 

Addend 

C 

Carry-in 

Output  Parameter 

Function  Description 

O 

Sum 

Carry 

Carry-  out 

SIMULATION  RESULTS: 

The  simulation  results  for  the  carry-look-ahead  adder  are  shown  in  Figure  4,  and  are 
found  to  be  functionally  correct.  For  example,  it  can  be  seen  from  the  results  in  Figure  4 
that  for  the  time  range  100ns  to  200ns,  if  signal  A  =  high,  B  =  low,  and  C  =  high,  then 
the  signal  ‘O’  for  sum’  is  low  and  signal  for  ‘carry’  is  high,  as  expected. 
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XPATCH  RAY  TRACER  MODULE:  2  bits  CLA  Ad< 


Figure  4:  Test  Simulation 


2.  Module:  Wallace  Tree  Multiplier 


The  Wallace  tree  multiplier,  shown  in  Figure  5,  was  designed  for  this  project  because  it  is 
much  faster  than  the  conventional  multipliers.  The  following  table  summarizes  the 
functions  of  the  various  input  and  output  parameters  of  this  multiplier. 


Input  Parameter 

Function  Description 

AfAO  (LSB)  -  A7  (MSB)] 

8  bits  multiplier 

B[B0  (LSB)  -  B7  (MSB)] 

8  bits  multiplicand 

Output  Parameter 

Function  Description 

Q  [QO(LSB)  -  Q15(MSB)] 

16  bits  product 

SIMULATION  RESULTS 

Simulation  results  for  the  Wallace  tree  multiplier  are  shown  in  Figure  6,  and  found  to  be 
functionally  correct.  For  example,  for  the  hexadecimal  number  system,  the  simulation 
results  in  Figure  6  (a)  show  that  when  the  multiplier  and  multiplicand  are  both  80  H  (128 
in  decimal  system),  the  output  is  4000  H  (16384  in  decimal). 

Q  =  A  *  B; 

=  80  (128  in  decimal)  *  80(128  in  decimal) 

=  40000  (16384  in  decimal) 
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Figure  5:  Wallace  Tree  Multiplier 


Figure  6(b):  Test  Simulation  (Bin) 
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3.  Module:  Parallel  in  -  Parallel  out  Shift  Right  Register 

The  SHIFTRE  module,  shown  in  Figure  7,  depicts  our  design  for  an  8  bit  parallel  in  - 
parallel  out  shift  right  register.  The  components  of  this  module  include  eight  D  type  flip 
flops  and  eight  two  input  multiplexers.  The  function  of  this  module  is  to  shift  one  bit  of 
information  to  the  right  at  each  pulse  of  the  clock.  The  following  table  summarizes  the 
functions  of  the  various  inputs  and  outputs  of  this  module. 


Input  Parameter 

Function  Description 

D  [DO  (LSB)  -  D7(MSB)] 

8  bits  input  bus 

CLK 

Clock  function 

ENB 

Enable  function 

RESET 

Reset  function 

SHIFT 

Shift  right  function 

Output  Parameter 

Function  Description 

Q  [QO(LSB)  -  Q7(MSB)] 

8  bits  output  bus 

SIMULATION  RESULTS: 

The  simulation  results  are  shown  in  Figure  8  and  found  to  be  functionally  correct.  For 
example,  for  the  input  pattern  ‘10000000’,  the  output  pattern  in  Figure  8  (b)  is  found  to 
be  01000000,  00100000,  00010000,  and  so  on,  for  successive  clock  cycles.  In  the  hex 
system,  as  shown  in  Figure  8  (a),  the  input  pattern  corresponds  to  80  H,  and  the 
successive  outputs  to  40  H,  20  H,  and  10  H. 
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Figure  8(b):  Test  Simulation  (Bin) 


4.  Module:  Parallel  in  -  Parallel  out  Shift  Left  Register 

The  SHIFTLE  module  is  an  8  bit  parallel  in  -  parallel  out  shift  left  register.  The  following 
table  lists  the  functions  of  the  various  input  and  output  parameters  of  this  module. 


Input  Parameter 

Function  Description 

D  [DO  (LSB)  -  D7(MSB)] 

8  bits  input  bus 

CLK 

Clock  function 

ENB 

Enable  function 

RESET 

Reset  function 

SHIFT 

Shift  left  function 

Output  Parameter 

Function  Description 

Q  [QO(LSB)  -  Q7(MSB)] 

8  bits  output  bus 

SIMULATION  RESULTS: 

The  simulation  results  shown  in  Figure  10  verify  the  functional  operation  of  the  shift  left 
register.  The  simulation  results  are  similar  to  the  previous  case  except  that  the  data  in  the 
input  pattern  are  shifted  left  instead  of  right. 


Shift  Left  Register 


20-26 


Figure  10(a):  Test  Simulation  (Hex) 


5.  Module:  2's  Complement 


The  TWCOM8,  shown  in  Figure  1 1,  is  an  8-bit  2's  complement  module.  The  function  of 
this  module  is  to  convert  an  eight  bit  input  to  the  2’s  complement  form.  The  result  is  used 
for  the  subtraction  process.  The  following  table  lists  the  functions  of  the  various  inputs 
and  outputs  of  this  module. 


Input  Parameter 

Function  Description 

A  [AO  (LSB)  -  A7(MSB)] 

8  bits  input  bus 

Output  Parameter 

Function  Description 

0  [O(LSB)  -  7(MSB)] 

8  bits  output  bus 

SIMULATION  RESULTS: 

The  simulation  results  for  the  2’s  complement  module  are  shown  in  Figure  12,  and  found 
to  be  functionally  correct.  For  example,  it  is  seen  from  Figure  12  that  at  150  ns,  the  input 
is  71  H  and  the  corresponding  output  8F,  as  expected. 
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The  Systolic  Processing  Element: 

The  schematic  for  the  systolic  processing  element  is  shown  in  Figure  13.  The  complete 
schematic  of  the  systolic  array,  corresponding  to  the  architecture  proposed  earlier  in 
Figure  2,  is  shown  in  Figure  14.  As  can  be  seen  from  this  figure,  the  systolic  array 
comprises  of  nine  identical  systolic  processing  units. 

The  systolic  processing  element  comprises  of  the  various  modules  described  in  the 
previous  sections.  The  function  of  the  systolic  processing  element  is  to  compute  the  ‘t’ 
parameter.  The  ‘t’  parameter,  as  explained  earlier,  is  given  by: 

t  =  [BBP  -  rs]  /rd 

In  the  schematic  for  the  systolic  processing  element  shown  in  Figure  13,  input  signal  ‘A’ 
corresponds  to  ‘BBP’,  and  input  ‘B’  corresponds  to  the  parameter  ‘rs’.  The  inverse  of 
‘rd’  is  represented  by  ‘M’,  a  floating  point  number.  Signal  ‘Ein’  represents  the  value  of 
the  exponent  for  the  floating  point  number.  The  following  table  lists  the  functions  of  the 
various  input  and  output  parameters  of  the  systolic  processing  element. 


Input  Parameter 

Function  Description 

A  [A0  (LSB)  -  A7(MSB)] 

8  bits  input  bus 

B  [BO(LSB)  -  B7(MSB)] 

8  bits  input  bus 

EIN1  [EINO(LSB)  -  EIN  1  (MSB)] 

Exponent  inputs 

CLK 

Clock  function 

ENB 

Enable  function 

RESET 

Reset  function 

SHIFT 

Shift  left  function 

Output  Parameter 

Function  Description 

O  [OO(LSB)  -  015(MSB)] 

1 6  bits  output  bus 

EOUT 

[EOUTO(LSB)  -  EOUT  1  (MSB)] 

Exponent  output 

SIMULATION  RESULTS: 

The  simulation  results  for  the  systolic  processing  element  is  shown  in  Figure  15,  and 
found  to  be  functionally  correct.  For  example,  for  the  time  range  225ns  to  300ns,  the 
inputs  are:  A  =  29  H  (41  in  decimal),  B  =  0E  H(15  in  decimal),  M  =  09  (9  in  decimal), 
and  Ein  =  1 .  In  other  words,  we  need  to  compute:  (41  - 15)  *  0.9  =  23.4.  By  checking  the 
simulation  results,  we  verify  the  functional  operation  of  the  processing  element  as 
follows: 
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sum  -  A-B  =  29  H  (41  in  decimal)  -  OE  H(15  in  decimal)  =  1B(26  in  decimal). 
Ein  =  1  ( which  means  that  M  should  be  multiplied  by  0.1) 

M  =  9 

015  =  sum  *  M  =  IB  H(26  in  decimal)  *  09  H  (9  in  decimal) 

=  00F3  H(  234  in  decimal) 

Eout  =  1  (which  means  that  015  equals  to  23.4  in  decimal,  as  expected) 
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Figure  14:  Systolic  Array  Architecture 
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Figure  15(b):  Test  Simulation  (Bin) 


Figure  15(b)  Contd.  :  Test  Simulation  (Bin) 
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Figure  15(b)  Contd.  :  Test  Simulation  (Bin) 


Implementation  Results 


The  implementation  results,  in  terms  of  CLB  (configurable  logic  blocks)  and  IOB  (input 
output  buffers)  utilization  are  shown  in  the  following  tables.  These  results  have  been 
generated  using  the  Xilinx  Ml  Foundation  tools.  It  is  seen  from  the  table,  that  the  systolic 
processing  module  occupies  10  %  of  the  xc4025E  FPGA.  Since  there  are  nine  identical 
processing  elements  in  our  systolic  array  architecture,  the  entire  systolic  array  will 
roughly  occupy  90%  of  the  CLBs  in  the  xc4025E  FPGA.  In  terms  of  IOBs,  it  is  seen  that 
the  systolic  module  requires  18  %  of  the  IOBs  on  the  xc4025E.  A  multiplication  factor  of 
9  (for  the  nine  systolic  processing  elements)  for  the  entire  systolic  array  yields  an 
utilization  factor  of  162  %.  Therefore,  two  daisy  chained  xc4025E  FPGAs  are  required 
for  mapping  the  complete  systolic  array  architecture. 
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DEVELOPMENT  OF  OPTIMUM  DRILLING  PROCESS  FOR 
ADVANCED  COMPOSITES 


Anthony  C.  Okafor 
Associate  Professor 

Department  of  Mechanical  and  Aerospace  Engineering 
and  Engineering  Mechanics 
University  of  Missouri-Rolla 


Abstract 


Extensive  experimental  investigation  was  conducted  to  study  the  effect  of  drilling  conditions 
( speed  and  feedrate),  drill  material  and  drill  point  angle  on  thrust  force,  torque,  acoustic  emission, 
drilled  hole  quality  (hole  size,  roundness,  and  delamination),  and  drill  wear  during  drilling  of 
advanced  composites.  The  overall  objective  of  the  research  is  to  develop  optimum  drilling  process 
and  an  intelligent  monitoring  system  for  predicting  drill  wear  and  drilled  hole  quality  in  advanced 
composites.  A  1/3  fractional  factorial  design  of  experiment  with  4  factors  at  3  levels  was  used  to 
conduct  the  series  of  drilling  experiments.  Acoustic  emission,  cutting  force,  and  cutting  torque  were 
monitored  during  drilling  in  advanced  composite  plates.  The  acquired  sensor  data  along  with 
measured  drill  wear  were  used  to  determine  optimum  drilling  condition  to  develop  smart  sensing 
system  using  neural  network  for  assuring  the  quality  of  drilled  holes  in  advanced  composites.  The 
project  is  divided  into  four  major  tasks.  The  first  task  involved  conducting  series  of  drilling 
experiments  using  the  1/3  fractional  factorial  design  of  experiment,  monitoring,  acquiring,  and 
processing  the  thrust  force,  drilling  torque,  and  acoustic  emission  data,  hole  exit  delamination,  hole 
diameter  and  hole  roundness  produced  during  drilling  in  advanced  composite  plate  (AS4/PEEK). 
The  second  task  involved  characterizing  the  main  effects  and  two  factor  interaction  of  cutting 
conditions  (feedrate  and  speed),  drill  point  angle,  and  drill  tool  materials  on  the  drilling  process 
response  (thrust  force,  drilling  torque,  acoustic  emission,  hole  exit  delamination,  hole  diameter,  and 
hole  roundness),  and  identification  of  optimum  drill  tool  material,  drill  point  angle,  and  cutting 
conditions  for  delamination  free  hole  drilling  in  advanced  composites.  Three  drill  materials  -  High 
Speed  Steel  Cobalt,  Carbide,  and  Polycrystalline  Diamond  (PCD)  were  investigated.  For  the  third 
task,  additional  drilling  experiments  were  conducted  using  the  identified  optimum  drilling  condition 
to  study  drill  wear  and  drill  life  characteristics  for  two  of  the  identified  two  best  drill  materials 
(carbide  and  PCD).  SEM  photographs  of  drilled  specimen  showing  extent  of  delamination  and 
internal  cracks  were  taken  and  used  to  corroborate  the  analysis  from  sensor  data.  The  fourth  task 
involved  the  development  of  neural  network  models  for  predicting  delamination,  drill  wear,  hole 
diameter  and  roundness  based  on  the  integration  of  the  multi-sensor  data.  This  fourth  task  is  the 
subject  for  future  research.  The  composite  plate  material  investigated  is  a  carbon  fiber  reinforced 
composites  (AS4/PEEK)supplied  by  McDonnell  Douglas  Aircraft  Company  in  St.  Louis,  Missouri. 
The  experiment  were  performed  on  a  Cincinnati  Milacron  Sabre  750  Machining  Center. 
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TECHNICAL  INFORMATION 


A.  Summary  of  Results 

One  MS  thesis  on  drilling  of  advanced  composites  and  three  papers  [1-3]  were  obtained  from 
the  research  and  are  in  the  final  preparation  stages  for  publication.  This  research  investigates  the 
effect  of  cutting  condition,  drill  material,  and  drill  point  angle  in  drilling  of  advanced  composites. 
HSS  Cobalt,  Solid  Carbide,  and  Polycrystalline  Diamond  (PCD)  Drills  were  investigated.  In 
reference  [1-2],  a  1/3  Fractional  Factorial  Design  of  Experiment  with  4  factors  at  3  levels  was  used 
to  determine  the  main  effects,  as  well  as  two  factor  interaction  of  drill  point  angle,  cutting  speed, 
feed  rate,  and  drill  material  on  drilling  thrust  force,  drilling  torque  acoustic  emission,  and  drill  hole 
quality  (hole  exit  delamination,  hole  diameter,  and  hole  roundness).  In  reference[  3],  the  optimal 
drilling  conditions  were  selected  to  maximize  hole  quality  and  additional  holes  were  drilled  to  study 
the  effects  of  tool  wear  on  thrust  force,  torque,  and  acoustic  emission.  The  tool  wear  portion  of  the 
experiment  was  performed  using  both  a  solid  carbide  drill  and  a  polycrystalline  Diamond  (PCD) 
Drill.  Thus  the  research  resulted  to  three  major  papers  which  are  now  under  review  for  journal 
publications.  These  three  papers  are  summarized  in  section  E  in  the  form  of  abstract  and 
conclusions. 

B.  Scientific  Collaborators; 

Dr.  A.  C.  Okafor,  Associate  Professor  and  Principal  Investigator 
Dr.  Wen  F.  Lu,  Associate  Professor  and  Co-Principal  Investigator 

Mr.  Scott  Birdsong,  an  MS  student  and  research  assistant  who  will  be  completing  his  degree  at 
the  University  of  Missouri-Rolla  in  August,  1998. 

Mr.  Yalcin  Ertekin,  a  Ph.D.  student  and  research  assistant  who  will  be  completing  his  degree  at  the 
University  of  Missouri-Rolla  in  December,  1998. 

Mr.  Anietie  Ukpong,  a  Ph.D.  student  and  research  assistant  who  will  be  completing  his  degree  at 
the  university  of  Missouri-Rolla  in  December,  1998. 

Mr.  Yang  Yuping,  a  Ph.D.  student  and  research  assistant  who  completed  his  degree  at  the  University 
of  Missouri-Rolla  in  December  1996. 

Mr.  Emmanuel  Ugo  Enemuoh,  a  Ph.D.  student  and  research  assistant  who  will  be  completing  his 
degree  at  the  University  of  Missouri-Columbia. 


21-3 


C.  Publications: 

1.  A.  Chukwujekwu  Okafor  and  Scott  Birsong,  "Effect  of  Drilling  Conditions,  Drill  Material 
and  Point  Angle  on  Thrust  Force  and  Torque  in  Drilling  Advanced  Composite  Material, " 
under  preparation  for  submission  to  the  Journal  of  Manufacturing  Systems:  Special  Issue  on 
Manufacturing  Processes. 

2.  A.  Chukwujekwu  Okafor  and  Scott  Birsong,  "Effect  of  Drilling  Conditions,  Drill  Material 
and  Point  Angle  on  Hole  Quality  and  Delamination  in  Drilling  Advanced  Composite 
Material, "  under  preparation  for  submission  to  the  Journal  of  Manufacturing  Systems:  Special 
Issue  on  Manufacturing  Processes. 

3 .  A.  Chukwujekwu  Okafor  and  Scott  Birsong,  "Effect  of  Tool  Wear  on  Thrust  Force,  Moment 
and  Acoustic  Emission  in  Drilling  Advanced  Composite  Material,"  under  preparation  for 
submission  to  the  Journal  of  Machine  Tools  and  Manufacture. 


D.  Theses  Completed  Acknowledging  the  Award: 

1.  Scott  Birdsong,  MSME,  University  of  Missouri-Rolla,  August,  1998,  "  Machining  of 
Advanced  Composites",  completed. 


E.  Title  and  Abstracts  of  Three  Papers  Resulting  from  the  Research: 


I.  EFFECTS  OF  DRILLING  CONDITIONS,  DRILL  MATERIAL,  AND  POINT  ANGLE 
ON  THRUST  FORCE  AND  TORQUE  IN  DRILLING  ADVANCED  COMPOSITE 

MATERIAL 

Abstract 


A  1/3  fractional  factorial  design  of  experiment  with  4  factors  at  3  levels  was  used  to  study  drilling 
of  advanced  multi-directional  carbon  fiber  reinforced  composite  plate.  Series  of  1/4  inch  holes  were 
drilled  in  a  0.3  inch  thick  AS4/PEEK  composite  plate  as  per  the  1/3  fractional  factorial  design  of 
experiment.  The  plate  was  mounted  on  a  specially  designed  fixture  which  allowed  for  the  acquisition 
of  thrust  force,  drilling  torque  (moment)  from  a  Kistler  9272  four  component  dynamometer.  The 
voltage  proportional  signals  were  separated  via  Kistler  Type  5405 A  junction  box  and  fed  into  separate 
Kistler  Type  5010  dual  mode  amplifiers.  The  amplified  signals  were  ran  through  low  pass  filters  to 
reduce  the  unwanted  noise  from  the  drilling  process.  The  digitized  signals  were  save  and  transferred 
to  a  Pentium  PC  for  further  processing.  Acoustic  emission  signals  generated  by  the  drilling  process  was 
picked  up  using  Digital  Wave  B-1025  broadband  sensors  mounted  on  the  AS4/PEEK  composite  plate 
and  to  the  drilling  fixture.  The  AE  signals  were  passed  through  Digital  Wave  PA2040GA  AE  pre- 
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amplifiers  with  40  dB  gain.  The  voltage  proportional  analog  signals  were  fed  into  a  Digital  wave  F4000 
Fracture  Wave  Detector  where  the  signals  were  filtered  with  band-pass  filters,  digitized  at  a  sampling 
frequency  of  6.25  MHz  with  record  length  of 2048  data  points  and  saved  directly  to  a  Pentium  PC.  The 
Thrust  force,  drilling  torque,  and  acoustic  emission  signals  were  acquired  for  the  entire  length  of  each 
hole  drilling  cycle.  The  design  of  experiment  was  conducted  to  determine  the  main  effects  and  two 
factor  interactions  of:  drill  point  angle,  cutting  speed,  feed  rate,  and  drill  material  on  thrust  force  and 
torque  (drilling  moment)  in  drilling  AS4/PEEK  composite  plate.  Drill  material,  feed  rate,  and  cutting 
speed  were  found  to  have  statistically  significant  effects  on  thrust  force  and  torque.  Drill  material,  feed 
rate  and  cutting  speed  account  for  over  78%  of  the  variation  measured  in  the  maximum  thrust  force  and 
RMS  thrust  force.  Cutting  speed  and  drill  material  were  found  to  be  the  only  statistically  significant 
factors  to  have  an  effect  on  the  maximum  drilling  moment  and  RMS  moment.  Spectral  analysis  of  the 
moment  signal  indicated  that  the  drilling  conditions  produced  no  statistically  significant  effects  in  the 
amplitude  of  the  spectral  density  at  the  cutting  tooth  frequency.  Cutting  speed  was  the  only  significant 
factor  that  contribute  to  the  amplitude  of  the  spectral  density  and  the  spindle  frequency. 

Conclusion 


From  the  results  of  this  research  the  following  conclusions  have  been  drawn: 

•  HS  S/Cobalt  drills  produce  holes  with  the  greatest  amount  of  maximum  thrust  force  and  RMS 
thrust  force,  while  carbide  drills  produce  holes  with  the  least  amount  of  maximum  thrust  force. 
The  maximum  thrust  force  produced  by  PCD  drills  are  a  little  higher  than  that  produced  by  carbide 
drills. 

•  Maximum  thrust  force  and  RMS  thrust  force  increase  with  increasing  feed  rate  and  decreases  with 
an  increasing  cutting  speed  for  all  drill  tool  materials  tested. 

•  Of  the  three  drill  point  angles  of  90°  ,  118°,  and  135°  investigated,  holes  drilled  with  drill  point 
angle  of  1 1 8°  produced  the  highest  maximum  thrust  force,  while  drills  with  drill  point  angles  of 
90°  produced  the  lowest  amount  of  maximum  thrust  force.  Drill  point  angle  of  1 18°  produced 
maximum  thrust  force  slightly  less  than  that  produced  by  drills  withl35°  point  angle. 

•  Drill  point  angle  of  135°  was  found  to  produce  the  largest  RMS  thrust  force  followed  by  118° 
point  angle  for  Carbide  and  PCD  material  drills,  while  drills  with  1 1 8°  point  angle  produced  the 
highest  RMS  thrust  force  when  drilling  with  HSS/Cobalt  drills. 

•  Drill  material,  feed  rate  and  cutting  speed,  contribute  approximately  77%  of  the  variance  measured 
in  the  maximum  thrust  force  and  82%  of  the  variance  measured  in  the  RMS  thrust  force. 

•  In  order  to  minimize  the  maximum  thrust  and  RMS  thrust  force  generated  when  drilling 
AS4/PEEK  composites,  the  drilling  conditions  of  90°  point  angle,  carbide  drill,  feedrate  of  50.8 
mm/min  (2  ipm)  and  spindle  speed  of  6000  rpm  should  be  selected. 
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•  The  effect  of  cutting  speed  on  the  RMS  moment  and  maximum  moment  is  non-linear,  with  the 
greatest  moment  produced  by  holes  drilled  at  4000  rpm.  Holes  drilled  at  2000  rpm  produced  the 
least  moment,  followed  by  6000  rpm. 

•  Carbide  drills  produced  holes  with  the  least  amount  of  RMS  moment  and  maximum  moment 
while  HS  S/Cobalt  material  drills  produce  the  greatest  RMS  moment  and  maximum  moment. 

•  Spindle  speed  and  drill  material  contributed  to  approximately  50%  of  the  variance  measured  in 
the  RMS  moment  and  36%  of  the  variance  measured  in  the  maximum  moment. 

•  Spindle  speed  contributed  14%  of  the  variance  measured  in  the  amplitude  of  the  spectral  density 
at  the  spindle  frequency. 

•  The  drilling  conditions  were  found  to  have  no  significant  effect  on  the  magnitude  of  the  spectral 
density  at  the  tooth  frequency. 


II.  EFFECTS  OF  DRILLING  CONDITIONS,  DRILL  MATERIAL  AND  POINT  ANGLE 
ON  HOLE  QUALITY  AND  DELAMINATION  IN  DRILLING  ADVANCED 

COMPOSITE  MATERIAL. 


Abstract 


A  1/3  fractional  factorial  design  of  experiment  with  4  factors  at  3  levels  was  used  to  study  drilling 
of  advanced  multi-directional  carbon  fiber  reinforced  composites.  Series  of  1/4  inch  holes  were  drilled 
in  a  0.3  inch  thick  AS4/PEEK  composite  plate  as  per  the  1/3  fractional  factorial  design  of  experiment. 
The  plate  was  mounted  on  a  specially  designed  fixture  which  allowed  for  the  acquisition  of  thrust  force, 
drilling  torque  (moment)  from  a  Kistler  9272  four  component  dynamometer.  The  voltage  proportional 
signals  were  separated  via  Kistler  Type  5405A  junction  box  and  fed  into  separate  Kistler  Type  5010 
dual  mode  amplifiers.  The  amplified  signals  were  ran  through  low  pass  filters  to  reduce  the  unwanted 
noise  from  the  drilling  process.  The  digitized  signals  were  save  and  transferred  to  a  Pentium  PC  for 
further  processing.  Acoustic  emission  signals  generated  by  the  drilling  process  was  picked  up  using 
Digital  Wave  B-1025  broadband  sensors  mounted  on  the  AS4/PEEK  composite  plate  and  to  the  drilling 
fixture.  The  AE  signals  were  passed  through  Digital  Wave  PA2040GA  AE  pre-amplifiers  with  40  dB 
gain.  The  voltage  proportional  analog  signals  were  fed  into  a  Digital  wave  F4000  Fracture  Wave 
Detector  where  the  signals  were  filtered  with  band-pass  filters,  digitized  at  a  sampling  frequency  of  6.25 
MHz  with  record  length  of  2048  data  points  and  saved  directly  to  a  Pentium  PC.  The  Thrust  force, 
drilling  torque,  and  acoustic  emission  signals  were  acquired  for  the  entire  length  of  each  hole  drilling 
cycle.  The  design  of  experiment  was  conducted  to  determine  the  main  effects  and  two  factor 
interactions  of:  drill  point  angle,  cutting  speed,  feed  rate,  and  drill  material  on  hole  quality  and 
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delamination  in  the  drilling  of  AS4/PEEK  composite  plate.  The  two  factor  interactions  of  point  angle 
and  drill  tool  material,  speed  and  drill  tool  material,  as  well  as  the  main  effect  of  drill  point  angle  were 
found  to  have  statistically  significant  effect  on  hole  diameter.  Drill  tool  material,  point  angle  and  drill 
tool  material  interaction,  and  the  main  effect  of  drill  point  angle  were  found  to  have  a  statistically 
significant  effect  on  hole  circularity.  Drill  tool  material  was  found  to  be  the  most  statistically 
significant  factor  and  accounted  for  over  50%  of  the  variation  measured  in  the  delamination  of  the  exit 
hole  side  of  the  composite  material. 

Conclusion: 

From  the  results  of  this  research,  the  following  conclusions  have  been  drawn: 

•  PCD  drills  produce  the  most  consistent  size  holes  throughout  the  different  ranges  of  cutting 
speeds  and  drill  point  angles. 

•  Increasing  the  drill  point  angle  has  a  tendency  to  produce  nominal  size  holes,  while  the  effects 
of  feed  rate  are  found  statistically  insignificant. 

•  The  drilling  conditions  of  1 18°  point  angle,  PCD  drill  material,  50.8  mm/min  (2ipm)  and  2000 
rpm  should  be  selected  to  produce  nominal  size  holes  in  drilling  AS4/PEEK. 

•  Carbide  tools  were  found  to  produce  drilled  holes  with  the  least  amount  of  circularity  error 
through  out  all  ranges  of  drill  point  angles  tested  followed  by  PCD  drills,  while  HSS/Cobalt 
drills  produced  holes  with  the  greatest  amount  of  circularity. 

•  Drills  with  118°  drill  point  angles  were  found  to  produce  the  truest  holes  for  all  materials 
measured. 

•  The  effects  of  feed  rate  and  cutting  speed  on  the  circularity  measurement  were  found  to  be 
statistically  insignificant. 

•  The  drilling  conditions  of  135°  point  angle,  carbide  drill  with  moderate  cutting  speeds  and  feed 
rates  should  be  selected  to  produce  hole  with  the  least  amount  of  circularity  error  in  drilling 
AS4/PEEK. 

•  Drill  material  selection  was  found  to  have  the  most  significant  effect  on  exit  hole  delamination 
in  drilling  AS4/PEEK  composites. 

•  Carbide  drills  were  found  to  produce  the  least  amount  of  exit  delamination  throughout  all 
ranges  of  speed  followed  by  PCD  drills,  while  HSS/Cobalt  drills  where  found  to  produce  holes 
with  the  greatest  amount  of  hole  exit  delamination. 

•  High  feed  rates  produce  the  least  amount  of  exit  delamination  when  using  drills  with  a  drill 
point  angle  of  118°  orl35°. 


21-7 


•  The  drilling  conditions  of  1 1 8°  point  angle,  carbide  drill  material,  1 52  mm/min  (6ipm)  and 
4000  rpm  should  be  selected  to  minimize  exit  delamination  in  drilling  AS4/PEEK. 

•  The  higher  the  point  angle  the  smaller  the  exit  hole  delamination  for  all  drill  tool  materials. 
Point  angle  of  90°  gave  the  highest  delamination. 


III.  EFFECTS  OF  TOOL  WEAR  ON  THRUST  FORCE,  TORQUE  AND  ACOUSTIC 
EMISSION  IN  DRILLING  ADVANCED  COMPOSITE  MATERIAL 


Abstract 


Additional  experimental  investigation  was  conducted  using  the  optimum  drilling  conditions 
identified  in  the  previous  research  (previous  two  papers)  to  study  drill  wear  and  drill  wear 
characteristics  for  the  identified  two  best  drill  tool  materials  (carbide  and  PCD  drills).  The  effects 
of  drill  wear  on  thrust  force,  cutting  torque  and  acoustic  emission  for  both  carbide  and  PCD  drills 
when  drilling  the  advanced  composite  plate  (AS4/PEEK)  were  investigated.  SEM  photographs  of 
drilled  specimen  showing  extent  of  delamination  and  internal  cracks  were  taken  and  used  to 
corroborate  the  analysis  from  sensor  data  and  measured  tool  wear.  Drilling  conditions  were  selected 
to  minimize  exit  hole  delamination  for  carbide  drill  and  a  series  of  80  holes  were  drilled  for  both 
carbide  and  PCD  drills.  The  thrust  force,  drilling  torque,  acoustic  emission  were  acquired  for  the 
entire  length  of  drilling  cycle  for  each  hole.  Carbide  drill  was  found  to  wear  more  that  PCD  drill. 
Maximum  and  RMS  thrust  force  were  found  to  increase  with  tool  wear.  The  maximum  drilling 
torque  was  found  to  increase  with  drill  wear  for  the  first  45  holes  drilled  after  which  it  began  to 
decrease  for  both  Carbide  and  PCD.  No  correlation  was  found  between  the  RMS  of  drilling  torque, 
the  spectral  density  amplitude  of  the  torque  signal  at  tooth  frequency  with  drill  wear.  The  average 
acoustic  emission  (AE)  energy  as  well  as  the  total  AE  energy  was  found  to  decrease  with  tool  wear. 


